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L REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/941,992 recorded 
November 16, 2001, at Reel 012176 and Frame 0450. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as M PRO1009". There exist two related patent applications, (1) U.S. Patent Application 
Serial No. 09/991,181, now Patent No. 6, 913, 919, issued 07-05-2005 (containing claims 
directed to nucleic acids encoding PRO1009 polypeptides), and (2) U.S. Patent Application 
Serial No. 09/993,687, filed November 14, 2001 (containing claims directed to PRO 1009 
polypeptides). The related U.S. Patent Application Serial No. 09/993,687 application is also 
under final rejection by the same Examiner and based upon the same outstanding rejections is 
being appealed independently and concurrently herewith. 

III. STATUS OF CLAIMS 

Claims 1 19-123 are in this application. 
Claims 1-118 and 124 have been canceled. 

Claims 1 19-123 stand rejected and Appellants appeal the rejection of these claims. 

IV. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 

Previously, in response to a Final Office Action mailed on September 16, 2005, a Notice 
of Appeal was filed on December 14, 2005 and an Appeal Brief was filed on February 13, 2006. 
An Examiner's Answer was mailed on May 2, 2006 and an RCE Response, with additional 
references and affidavits supporting Appellants' arguments, was filed on June 29, 2006. 
Thereafter, two Final Office Actions were mailed: one on September 21, 2006 and the other on 
October 13, 2006. A Notice of Appeal was filed March 13, 2007. 

No claim amendments have been submitted after the last final rejection of October 13, 

2006. 

A copy of the rejected claims in the present Appeal is provided in Section VIII. 
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V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated antibody and 
antibody fragments that specifically binds to the polypeptide of SEQ ID NO: 194 (Claim 119), 
referred to in the present application as "PRO1009". The invention is further directed to 
monoclonal antibodies (Claim 120), humanized antibodies (Claim 121), antibody fragments 
(Claim 122) and labeled antibodies (Claim 123) that specifically bind to the polypeptide of SEQ 
ID NO: 194. The PRO1009 gene was shown for the first time in the present application to be 
significantly amplified 2.085 fold to 4.287-fold in twelve different colon primary tumors as 
compared to normal, non-cancerous human tissue controls (Example 170). 

Support for the preparation and uses of antibodies is found throughout the specification, 
including, for example, pages 390-395. The preparation of antibodies is described in Example 
144, while Example 145 describes the use of the antibodies for purifying the polypeptides to 
which they bind. Isolated antibodies are defined in the specification at page 315, line 3 1 . 
Support for monoclonal antibodies is found in the specification at, for example, page 390, line 
17, to page 392, line 3. Support for humanized antibodies is found in the specification at, for 
example, page 392, line 4, to page 393, line 6. Support for antibody fragments is found in the 
specification at, for example, page 314, line 30 onwards. Support for labeled antibodies is found 
in the specification at, for example, page 316, lines 3. 

The amino acid sequence of the native "PRO 1009" polypeptide and the nucleic acid 
sequence encoding this polypeptide (referred to in the present application as "DNA57129-1413") 
are shown in the present specification as SEQ ID NOs: 194 and 193, respectively, and in 
Figures 121 and 122, described on pages 293, lines 23-26. The full-length PRO 1009 polypeptide 
having the amino acid sequence of SEQ ID NO: 194 is described in the specification at, for 
example, on page 15 and pages 1 16-1 18 and the isolation of cDNA clones encoding PRO 1009 of 
SEQ ID NO: 194 is described in Example 51, page 444-445 of the specification. 

Finally, Example 170, in the specification at page 539, line 19, to page 555, line 5, sets 
forth a 'Gene Amplification assay' which shows that the PRO 1009 gene is amplified in the 
genome of certain human colon cancers (see Table 9B, page 552-553). The profiles of various 
primary colon tumors used for screening the PRO polypeptide compounds of the invention in the 
gene amplification assay are summarized on Table 8, page 546 of the specification. 
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VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1. Whether Claims 1 19-123 satisfy the utility/ enablement requirement under 35 
U.S.C. §§101/112, first paragraph. 

VII. ARGUMENTS 
Summary of the Arguments: 
Issue 1: Utility/ Enablement 

As a preliminary matter, Appellants note that the Examiner refers to the microarray assay 
in several instances in several Office Actions, and cites references like Lilley et ai, King et al. 9 
Wildsmith et aL, Chen et al etc. which use and analyze the microarray assay. On the other hand, 
Appellants rely upon the gene amplification data of the PRO 1009 gene for patentable utility of 
the PRO 1009 polypeptides and their antibodies in the present application, and would like to 
distinguish between the gene amplification and the microarray assay. The gene amplification 
assay measures the level at which a certain gene (i.e. DNA) is amplified in the genome, whereas 
the microarray assay measures the level of expression of a mRNA encoding for a certain 
polypeptide in a sample. Throughout prosecution, the Examiner fails to distinguish between 
these two techniques, but Appellants submit that the two assays, although similar, are not the 
same. Appellants would like to bring to the Examiner's attention a recent decision in a 
microarray case by the Board of Patent Appeals and Interferences (Decision on Appeal No. 
2006-1469). In its decision, the Board reversed the utility rejection, acknowledging that "there is 
a strong correlation between mRNA levels and protein expression, and the Examiner has not 
presented any evidence specific to the PR01866 polypeptide to refute that." (Page 9). 
Appellants submit that, in the instant application, the Examiner has likewise not presented any 
evidence specific to the PRO1009 polypeptide to refute Applicant's assertion of a correlation 
between DNA levels, mRNA levels and protein expression. Appellants add that they analyze the 
microarray assay only in response to the Examiner's cited references. 

Gene amplification data of the PRO 1009 gene is clearly disclosed in the instant 
specification in Example 170 which discloses that the gene encoding PRO 1009 showed 
significant amplification, ranging from 2.085 fold to 4.287-fold in twelve different colon 
primary tumors . Example 170 in the instant specification discloses that, "(a)mplification is 
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associated with overexpression of the gene product, indicating that the polypeptides are useful 
targets for therapeutic intervention in certain cancers such as colon, lung, breast and other 
cancers and diagnostic determination of the presence of those cancers" (emphasis added). 
Appellants have submitted, in their Response filed August 4, 2005, a Declaration by Dr. Audrey 
Goddard, which explains that a gene identified as being amplified at least 2-fold by the disclosed 
gene amplification assay in a tumor sample relative to a normal sample is useful as a marker for 
the diagnosis of cancer , and for monitoring cancer development and/or for measuring the 
efficacy of cancer therapy. Therefore, such a gene is useful as a marker for the diagnosis of 
colon cancer , and for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

Appellants have also submitted, in their Responses filed June 18, 2004, and June 29, 
2006, ample evidence to show that, in general, if a gene is amplified in cancer, it is more likely 
than not that the encoded protein will be expressed at an elevated level. For instance, the articles 
by Orntoft et al. 9 Hyman et al. 9 and Pollack et al collectively teach that in general gene 
amplification increases mRNA expression . Second, the Declarations of Dr. Paul Polakis: 
(Polakis I and II), show that, in general there is good correlation between mRNA levels and 
polypeptide levels . Third, Appellants further submit that even if there were no correlation 
between gene amplification and increased mRNA/protein expression, (which Appellants 
expressly do not concede to), a polypeptide encoded by a gene that is amplified in cancer would 
still have a specific, substantial, and credible utility. Appellants submit that, as evidenced by the 
Ashkenazi Declaration and the teachings of Hanna and Mornin (both made of record in 
Appellants' Response filed June 18, 2004), simultaneous testing of gene amplification and gene 
product over-expression enables more accurate tumor classification , even if the gene-product, the 
protein, is not over-expressed. This leads to better determination of a suitable therapy for the 
tumor, as demonstrated by a real-world example of the breast cancer marker HER-2/neu. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not informative of 
the protein level). Therefore, as a general rule, one skilled in the art would find it more likely 
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than not that PRO 1009 and its antibodies are useful as a diagnostic tools for detecting colon 
tumors. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PRO1009 polypeptides and antibodies binding to the PRO1009 polypeptides. 
Accordingly, one of ordinary skill in the art would also understand how to make and use the 
recited antibodies for the diagnosis of colon cancer without any undue experimentation. 

These arguments are all discussed in greater detail below, under their appropriate 
headings. 

Response to Rejections 

ISSUE 1. Claims 119-123 are Supported by a Credible, Specific and Substantial Asserted 
Utility, and Thus Meet the Utility Requirement of 35 U.S.C. $101 and the "How to Use 
Prong" of the Enablement Requirement of 35 U.S.C. $112, First Paragraph 

The sole basis for the Examiner's rejection of Claims 1 19-123 under these sections is that 
the data presented in Example 170 of the present specification is allegedly insufficient under 
applicable legal standards to establish a patentable utility under 35 U.S.C. §101 for the presently 
claimed subject matter, and further, since a patentable utility has not been established, one would 
not know how to use the claimed invention. 

Appellants strongly disagree and respectfully traverse the rejection. 

A. The Legal Standard For Utility Under 35 U.S.C. $101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent Applicant disclose a "substantial utility" for his or 



1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 
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her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 
commerce rather than the realm of philosophy." 

Later, in Nelson v. Bowler* the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 , ' 

In Cross v. Iizuka, 6 the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 
i.e., there is a reasonable correlation there between." 7 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 



2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. Iizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 
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The case law has also clearly established that Appellants 1 statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

10 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Appellant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope." ' 

Compliance with 35 U.S.C. §101 is a question of fact.' 3 The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines"),' 5 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 

Ibid. 

" In re Longer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In reJolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
USPQ 351 (1965); In re Sichert, 566F.2d 1154, 1159, 196 USPQ 209, 212-13 (C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert, denied, 469 
US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 

15 66 Fed. Reg. 1092 (2001). 
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a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial' 'utility.'" 16 Indeed, the Guidelines for Examination of 
Applications for Compliance With the Utility Requirement, 17 gives the following instruction to 
patent examiners: "If the Applicant has asserted that the claimed invention is useful for any 
particular practical purpose ... and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants submit that the evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
Appellant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
skill in the art would doubt the truth of the statement of utility. Only after the Examiner has 
made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Appellant. 

Appellants respectfully submit that the data presented in Example 170 starting on page 
539 of the specification and the cumulative evidence of record support a "specific, substantial 
and credible" asserted utility for the presently claimed invention. 

Patentable utility for the PRO 1009 polypeptides and their antibodies is based upon the 
gene amplification data for the gene encoding the PRO1009 polypeptide of SEQ ID NO: 194. 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 11(B)(1). 

-9- 

On Appeal to the Board of Patent Appeals and Interferences 

Appellants' Brief 
Application Serial No. 09/990,443 
Attorney's Docket No. 39780-2730 P1C12 



Example 170 describes the results obtained using a very well-known and routinely employed 
polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred to herein 
as the gene amplification assay. This assay allows one to quantitatively measure the level of 
gene amplification in a given sample, say, a tumor extract, or a cell line. It was well known in 
the art at the time the invention was made that gene amplification is an essential mechanism for 
oncogene activation. Appellants isolated genomic DNA from a variety of primary cancers and 
cancer cell lines that are listed in Table 9 (pages 539 onwards of the specification), including 
primary colon cancers of the type and stage indicated in Table 8 (page 546). The tumor samples 
were tested in triplicates with Taqman™ primers and with internal controls, beta-actin and 
GADPH in order to quantitatively compare DNA levels between samples (page 548, lines 33- 
34). As a negative control, DNA was isolated from the cells of ten normal healthy individuals, 
which was pooled and used as a control (page 539, lines 27-29) and also, no-template controls 
(page 548, lines 33-34). The results of TaqMan™ PCR are reported in ACt units, as explained in 
the passage on page 539, lines 37-39. One unit corresponds to one PCR cycle or approximately 
a 2-fold amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold 
amplification and so on. Using this PCR-based assay, Appellants showed that the gene encoding 
for PRO 1009 was amplified, that is, it showed approximately 1.06-2.10 ACt units which 
corresponds to 2 106 -2 2 * 10 - fold amplification or 2.085 fold to 4.287-fold in twelve different 
colon primary tumors . 

Appellants submitted a Declaration by Dr. Audrey Goddard which provides a statement 
by an expert in the relevant art that "fold amplification" values of at least 2-fold are considered 
significant in the TaqMan™ PCR gene amplification assay. Appellants particularly draw the 
Board's attention to page 3 of the Goddard Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal (i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy number 
in the tumor sample relative to the normal sample serves as a basis for using 
relative gene copy number as quantitated by the TaqMan PCR technique as a 
diagnostic marker for the presence or absence of tumor in a. tissue sample of 
unknown pathology. Accordingly, a gene identified as being amplified at least 2- 
fold by the quantitative TaqMan PCR assay in a tumor sample relative to a normal 
sample is useful as a marker for the diagnosis of cancer, for monitoring cancer 
development and/or for measuring the efficacy of cancer therapy. 
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(Emphasis added). 



Accordingly, the 2.085 fold to 4.287-fold in twelve different colon primary tumors 
would be considered significant and credible by one skilled in the art, based upon the facts 
disclosed in the Goddard Declaration. 

Further, as discussed in detail below, Appellants have provided ample evidence in the 
form of articles from the art, like Orntoft et al, Hyman et al, Pollack et al, and over a 100 
references (see Evidence List items 14-145) and Declarations by experts in the field of oncology 
and gene expression, i.e.: the Declarations by Dr. Paul Polakis (I and II) and by Dr. Avi 
Ashkenazi , to show that, in general, if a gene is amplified in cancer, it is "more likely than not" 
that the encoded protein will also be expressed at an elevated level. 

C. A prima facie case of lack of utility has not been established 

As discussed above, the increase in DNA copy number for the PRO 1009 gene is 
significant. Further, the evidentiary standard to be used throughout ex parte examination of a 
patent application is a preponderance of the totality of the evidence under consideration. Thus, 
to overcome the presumption of truth that an assertion of utility by the applicant enjoys, the 
Examiner must establish that it is more likely than not that one of ordinary skill in the art would 
doubt the truth of the statement of utility. 

Accordingly, it is not a legal requirement to establish a necessary correlation between an 
increase in the copy number of the DNA and protein expression levels that would correlate to the 
disease state or that it is imperative to find evidence that DNA amplification is " necessarily " or 
"always" associated with overexpression of the gene product. Appellants respectfully submit 
that when the proper evidentiary standard is applied, a correlation must be acknowledged. Only 
after the Examiner has made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. 

Previously, the Examiner has indicated, based on references Pennica et al, Konopka et 
al and Haynes et al, that gene amplification data cannot reliably predict protein levels. 
Appellants have argued the references in great detail throughout prosecution, and these 
arguments are incorporated by reference herein for brevity. Appellants summarize the rejections 
and the arguments submitted below. 
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The teachings of Pennica et al are specific to WISP genes, a specific class of closely 
related molecules. Pennica et al showed that there was good correlation between DNA and 
mRNA expression levels for the WISP-1 gene but not for WISP -2 and WISPS genes. WISPs 1- 
3 have no structural relationship to the PRO 1009 polypeptides of the present application. The 
apparent finding that for two out of three specific molecules, that are related to each other but 
have no relationship to PRO 1009, there was no correlation between gene amplification and the 
level of mRN A/protein expression does not establish, in general, that it is more likely than not 
that such correlation does not exist, and has no bearing whatsoever on determining the question 
whether such correlation is likely to exist between PRO1009 gene amplification and 
mRNA/protein expression levels. As discussed above, the standard is not absolute certainty . 
Pennica et al has no teaching whatsoever about the correlation of gene amplification and protein 
expression for genes in general or PRO 1009 or related molecules in particular. 

Similarly, in Konopka et al, the Examiner has generalized a very specific result disclosed 
by Konopka et al to cover all genes. Konopka et al actually state that "[pjrotein expression is 
not related to amplification of the abl gene but to variation in the level of bcr-ab\ mRNA 
produced from a single Ph 1 template." (See Konopka et al Abstract, emphasis added). The 
paper does not teach anything whatsoever about the correlation of protein expression and gene 
amplification in general and provides no basis for the generalization that apparently underlies 
the present rejection. The statement of Konopka et al that "[pjrotein expression is not related to 
amplification of the abl gene ..." is not sufficient to establish a prima facie case of lack of 
utility; Therefore, the combined teachings of Pennica et al and Konopka et al are not directed 
towards genes in general but to a single gene or genes within a single family and thus, their 
teachings cannot support a general conclusion regarding correlation between gene amplification 
and mRNA or protein levels. In addition, the abl gene has no structural relationship to the 
PRO 1009 gene of the present application and thus Konopka et al provides no information of 
specific relevance to the question whether for PRO 1009 there is a reasonable expectation that 
correlation between gene amplification and mRNA/protein expression levels is likely to exist. 

The Examiner also cited Haynes et al to show that transcript levels and protein levels do 
not correlate. However, Appellants had shown that Haynes et al, themselves admit that " there 
was a general trend, although no strong correlation between protein [expression] and transcript 

-12- 

On Appeal to the Board of Patent Appeals and Interferences 

Appellants' Brief 
Application Serial No. 09/990,443 
Attorney's Docket No. 39780-2730 P1C12 



levels." (see Figure 1 and page 1863, paragraph 2.1, last line). Therefore, when the proper legal 
standard is used, Haynes clearly supports the Appellants' position that in general, a positive 
correlation exists between mRNA and protein expression levels . Since accurate predicti on is not 
the standard , a prima facie case of lack of utility has not been met based on the cited references 
Pennica et al, Konopka et al and Haynes et al Appellants respectfully . submit that, contrary to 
the Examiner's assertion, none of the cited reference conclusively establish a prima facie case 
for lack of utility for the PRO 1 009 molecule. 

Appellants have already discussed references Hu et al, et al, Chen et al, Lian et al, 
Fessler et al, in great detail in their previous responses (see Appeal brief filed February 13, 
2006), and these arguments are hereby incorporated by reference for brevity. 

Briefly, the analytical methods utilized by Hu et al have certain statistical drawbacks, as 
the authors themselves admit, and Hu et aVs conclusions only apply to a specific type of breast 
tumor (estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast cancer 
genes in general, let alone to cancer genes in general. 

Regarding Lian et al, Appellants respectfully submit that Lian et al only teach that 
protein expression may not correlate mRNA level in differentiating myeloid cells, and not in 
genes in general In fact, the authors themselves admit that there were a number of problems 
with their data . For instance, at page 520 of this article, the authors explicitly express their 
concerns regarding the methods they utilized and the interpretation of their data stating that 
" |Y|hese data must be considered with several caveats: membrane and other hydrophobic proteins 
and very basic proteins are not well displayed bv the standard 2DE approach, and proteins 
presented at low level will be missed. In addition, to simplify MS anlvsis, we used a Coomassie 
dye stain rather than silver to visualize proteins, and this decreased the sensitivity of detection of 
minor proteins. " (Emphasis added). Besides, Lian et al.'s conclusions are based on the 
Coomassie dye staining method, which is not a very sensitive method of measuring protein. 

Similarly, in Fessler et al, examined lipopoysaccharide-activated neutrophilia, in 
response to LPS stimulation. Fessler et al also used the Coomassie Blue dye staining method, 
and concede that it is known to have a limited protein binding range and a non-linear curve for 
protein detection. Protein identification in their study was also done using two-dimensional 
PAGE, which is, by their own admission, is limited only to well-resolved regions of the gel and 
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therefore, may have performed less well with hydrophobic and high molecular weight proteins 
(seepage 31301, col. 1). 

"in addition, Chen et al also concede that there are problems with 2D gel protein 
detection and therefore, cannot accurately predict protein levels . For instance, Chen et al says 
that, "(i)t is apparent that without prior enrichment only a relatively small and highly selected 
population of long-lived, highly expressed proteins is observed. There are many more proteins in 
a given cell which are not visualized by such methods. Frequently it is the low abundance 
proteins that execute key regulatory functions " (page 1870, col. 1, emphasis added). Thus, Chen 
et al, concede that by selecting proteins visualized by 2D gels, they are likely to have excluded 
in their analysis many key regulatory proteins which could be candidate cancer markers. 

Further, the microarray analysis provided by Chen et al in fact support the Appellants 
general proposition that, even if protein levels cannot be accurately predicted (which is not 
required by the utility standard), in the majority of the proteins studied, it is most likely than not 
that an increase in gene amplification or mRNA levels generally correlates well with increased 
protein levels. In Table 1, which lists 66 genes [the paper incorrectly states there are 69 genes 0 
listed] for which only one protein isoform is expressed, Chen et al show that 40 genes out of 66 
had a positive correlation between mRNA expression and protein expression , which clearly 
meets the standard for "more likely than not". Similarly, in Table II , 22 genes out of 30 [again 
the paper incorrectly states there are 29] and at least one isoform showed a positive correlation 
between mRNA expression and protein expression. Furthermore, 12 genes out of 29 showed a 
strong positive correlation [as determined by the authors] for at least one isoform. Therefore, the 
authors of the Chen paper clearly agreed that microarrays provided a reliable measure of the 
expression levels of the gene and could be used to identify genes whose overexpression is 
associated with tumors. 

In summary, Hu et al, Lian et al Chen et al, and Fessler et al do not conclusively teach 
that, in general, protein levels cannot be accurately predicted from mRNA/ gene amplification 
levels. These authors concede that either due to insensitive protein detection methods or due 
their methodologies utilized in their protocols, some protein species may have been 
underrepresented over others. Therefore, the teachings of these references cannot be relied upon 
to establish a prima facie showing of lack of utility. On the other hand, as noted even in Haynes 
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et al and Chen et al, most genes showed a positive correlation between increased gene 
amplification, mRNA and translated protein. 

Alberts and Lewin et al (Items 15 and 16 of the Evidence List) 

The Examiner acknowledges that "the teachings of Alberts and Lewin (which) disclose 
that initiation of transcription is a common point for a cell to regulate the gene expression" but 
asserts that "it is not the only means of regulating gene expression". (Final Office Action of 
September 21, 2006). 

Appellants respectfully disagree and submit that the utility standard is not absolute 
certainty. Therefore, Appellants do not need to establish that the production of RNA is 
inevitably equated with production of protein in order to meet the utility standard. Instead, as 
long as it is more likely than not that a change of the transcription level of a gene gives rise to a 
change in translation level of a gene, the utility standard is met. 

Meric et al (Item 92 of the Evidence List) 

The Examiner asserts that Meric teaches that "gene expression is quite complicated and is 
also regulated at the level of mRNA stability, mRNA translation, and protein stability" (Final 
Office Action of September 21, 2006). 

Appellants respectfully submit that Meric simply summarizes the translation regulation of 
cancer cells. Meric indicates that translation initiation is regulated in response to nutrient 
availability and mitogenic stimulation and is coupled with cell cycle progression and cell growth. 
Meric further discusses that alteration in translation control occur in cancer. For example, 
variant mRNA sequences can alter the translational efficiency of individual mRNA molecule. 
(See Abstract). Meric further teaches that the changes of the translational efficiency of a mRNA 
transcript depend on the mutation of a specific mRNA sequence. (Page 973, column 2 to 
page 91556, column 1). Meric never suggest that the translation of a cancer gene is suppressed 
in cancer in general and therefore, an increased mRNA levels will not yield an increased protein 
levels. To the contrary, Meric teaches that the translation efficiency of a number of cancer genes 
is enhanced in cancer cells compared to its normal counterpart. For instance, in a patient with 
multiple myeloma, a C-T mutation in the c-myc IRES was identified and found to cause an 
enhanced initiation of translation. (Page 91556, column 1). Therefore, the level of proteins 
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encoded by these genes increases in cancer cells at an even higher magnitude than the mRNA 
level. 

Appellants emphasize that it is not a legal requirement to establish an absolute correlation 
between an increase in the mRNA level and protein expression levels that would correlate to the 
disease state nor is it imperative to find evidence that protein levels can be accurately predicted. 
Therefore, the Examiner has misinterpreted the teaching of Meric and applied improperly high 
evidentiary standard. Therefore, the Examiner has failed to establish a prima facie showing of 
lack of utility based on the rejection of Meric et al. 

Godbout et al. (Item 46 of Evidence List) 

The Examiner asserts that of the references cited by Appellants in their IDS filed June 29, 
2006, only "Godbout is drawn to the predictability of protein levels based upon genomic DNA 
amplification" (Final Office Action of October 1 3, 2006). 

Appellants respectfully submit that, as discussed in the RCE response of June 29, 2006, 
there are several other references; for instance, Bea et al. investigated gene amplification, mRNA 
expression, and protein expression of the putative oncogene BMI-1 in human lymphoma 
samples, and supports Appellants' assertion that gene amplification is correlated with both 
increased mRNA and protein expression. 

The Examiner further asserts that unlike Godbout et al, the instant specification does not 
teach structure/ function analysis and the Examiner questions whether the level of genomic 
amplification of DDX1 gene is comparable to that disclosed by PRO1009. 

Appellants respectfully submit that it was never claimed that PRO1009 is similar in any 
way to the DDX1 gene of Godbout et al.; in fact the Godbout reference was submitted to show 
good correlation between protein levels based upon genomic DNA amplification, which the 
Examiner clearly agrees with. Structure/function data, which the Examiner requests, is not a 
requirement for the utility requirement, hence this rejection is improper. 

Lillev et al.. King et al. and Wildsmith et al. 

The Examiner cites Lilley et al. to show that "the extrapolation that changes in transcript 
level will also result in corresponding changes in protein amount or activity cannot always be 
made". (Final Office Action of September 2 1 , 2006). 
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Appellants repeat that it is not a legal requirement for utility, to establish a necessary 
correlation between an increase in the mRNA level and protein expression levels, or to show that 
changes in transcript level should always result in corresponding changes in protein amount or 
activity. Accordingly, the question is not whether a correlation between an increase in mRNA 
and protein expression levels always exists, rather if it is more likely than not that a person of 
ordinary skill in the pertinent art would recognize such a positive correlation. Nowhere in the 
Lilley paper does the author suggest that it is more likely than not that altered mRNA levels 
does not correlate with altered protein levels. On the contrary, the statement that "changes in 
transcript level will also result in corresponding changes in protein amount or activity cannot 
always be made" implies that the mRNA/protein correlation exists in most cases. 

The Examiner cites King et al to show that "mRNA levels do not necessarily correlates 
with protein levels". (Final Office Action of September 21, 2006). 

First of all, Appellants note that the instant application is not directed to the microarray 
assay as in King et al. As discussed above, the utility standard is not absolute certainty. 
Therefore, Appellants should not be required to show a "necessary correlation" between mRNA 
and protein levels in order to establish a patentable utility. King never indicates that it is more 
likely than not that a general correlation between the mRNA and protein levels for a gene does 
not exist, this paper alone does not suffice to establish a prima facie showing of lack of utility. 

Appellants also note that the author discussed numerous advantages of the microarray 
technology, which offers tremendous advantages in the study of human diseases. For instance, 
on page 2287, the author states that "microarrays can be expected to prove extremely valuable as 
tools for the study of the generic basis of complex diseases. The ability to measure expression 

profiles across entire genomes provides a level of information not previously attainable 

Microarrays make it possible to investigate differential gene expression in normal vs. diseased 
tissue, in treated vs. non-treated tissue, and in different stages during the natural course of the 
disease, all on a genomic scale. Gene expression profiles may help to unlock the molecular basis 

of phenotype, response to treatment, and heterogeneity of disease " Therefore, if anything, 

the King reference supports the use of the microarray in the diagnosis of human diseases, which 
silently assumes that, most probably, increases in mRNA levels correlate well with increases in 
protein levels which in turn impacts disease. 
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Similarly, Wildsmith never indicated that it is more likely than not that a general 
correlation between the mRNA and protein levels for a gene does not exist. Therefore, this paper 
is not sufficient to establish a prima facie showing of lack of utility. In fact, the Wildsmith 
paper discusses examples of a number of successes of microarray applications in the detection of 
human diseases (see Page 284). For instance, the author has pointed out that "one area of rapid 
progress using microarray technology is the increased understanding of cancer. Molecular 
pathologies are subgrouping cancers of tissues such as blood, skin, and breast, based on 
differential gene expression patterns. For example, within a small group of breast cancer tissue 
samples, Perou et al distinguished two broad subgroups representing those expressing or 
alternatively lacking expression of the oestrogen receptor- y- gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared with the previous 
methods of gene amplification. Another example of the impact of this technology is in the 
identification of two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran et 
al, 2001). Microarray technology has also accelerated the understanding of the molecular events 
surrounding pulmonary fibrosis. Specially, two distinct clusters of genes associated with 
imflammation and fibrosis have been identified in a disease where, for years, the pathogenesis 
and treatment have remained unknown (Katsuma et al, 2001)." 

Therefore, collectively, the references Lilley et al, Wildsmith etal, and King et al 
references cited by the Examiner show that the art indicates that, generally, if a mRNA is 
overexpressed in cancer, it is more likely than not that the encoded protein will also be expressed 
at an elevated level. 

Madoz Gurpide et al and Celis et al 

The Examiner cites Madoz-Gurpide et al and Celis et al to show that one cannot 
accurately predict protein levels based on mRNA levels (made of record by the Examiner in the 
Final Office Action mailed September 2 1 , 2006). 

Madoz-Gurpide et al explains that mRNA expression alone does not provide 
information regarding the "activation state, post-translational modification or localization of 
corresponding proteins" (emphasis added; page 168, col. 1). That is, Madoz-Gurpide et al 
explain that mechanisms are not apparent from mRNA expression alone. Madoz Gurpide et al 
further state that,' it is "unclear" how well the reported RNA levels correlate with protein levels. 
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In support of this assertion, the authors cite only a single reference, namely, the Chen et al, 
which was discussed above and in detail in the Appeal Brief filed February 13, 2006. Madoz 
Gurpide et al also acknowledge that the DNA microarray studies, such as those carried out by 
Beer et al (specifically cited by the authors at page 52), "justify the use of this technology for 
uncovering patterns of gene expression that are clinically informative" (emphasis added; 
page 53). Thus, while Madoz-Gurpide et al note that it is "more difficult to develop an 
understanding of disease at a mechanistic level using DNA microarrays," (emphasis added; 
page 53), Appellants respectfully point out that that "understanding of a disease at the 
mechanistic level" is not relevant to Appellants' assertions of utility, as discussed above. 
Accordingly, a prima facie case cannot be made based on the teachings within the Madoz- 
Gurpide et al reference. 

Like Madoz-Gurpide et al, Celis et al discuss that mechanisms are not apparent from 
mRNA expression alone. Celis et al note that "proteomics addresses problems that cannot be 
approached by DNA analysis, namely, relative abundance of the protein product, post- 
translational modification, subcellular localization, turnover, interaction with other proteins as 
well as functional aspects" (page 6, col. 2). However, in their discussion, Celis et al cite Orntoft 
et al (Item 103 of Evidence List) and note that "in most cases there was a good correlation 
between transcript and protein levels." Celis et al further explain that those few cases which 
showed apparent discrepancies may have been due to other causes, such as post-transcriptional 
processing or degradation of the protein, or the choice of methods used to assess protein 
expression levels. Celis et al note that, the observation that, there is often more change in 
mRNAs as compared to the proteins may be due to the fact that current technologies detect 
mainly high abundance proteins, while most of the changes affecting protein levels may involve 
low abundance proteins. Thus, the correlation between mRNA and protein levels may be even 
higher than typically observed, given these factors. Celis et al explain that proteomics is useful 
in combination with arrays "for the entire process of drug development and evaluation." (page 6; 
col. 1). Appellants further submit that significant correlations between gene and protein 
expression are most likely to be observed for genes associated with cancer, since as Celis et al 
note, "transformation resulted in the abnormal expression of normal genes, rather than in the 
expression of new ones" (page 11, col. 1). Accordingly, alterations in gene amplification or 
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expression are more likely to be associated with altered protein expression in the case of cancer 
than in other cases where DNA microarrays are used, because, as explained by Celis et al, the 
alterations in expression levels of certain normal proteins are part of the process that leads to 
cancer. 

Accordingly, a prima facie case cannot be made based on the teachings within the Celis 
et al reference. In fact, Appellants submit that Celis et al support the Appellants contention that 
it is more likely than not that changes in mRNA levels reflect changes in protein levels, in 
general. 

Nagaraia et al. Waghray et aU Saganaliev et al 

The Examiner also asserts that "[c]omprehensive studies where significantly large 
numbers of transcripts and proteins were examined report that increases in mRNA and protein 
samples are not correlated." (Final Office Action of September 21, 2006). In support of this 
assertion, the Examiner cites three new references, by Nagaraja et al, Waghray et al, and 
Sagynaliev et al 

The Examiner cites Nagaraja et al as allegedly teaching that "the proteomic profiles 
indicated altered abundance of fewer proteins as compared to transcript profiles." (Final Office 
Action of September 2 1 , 2006). 

Appellants respectfully submit that the fact that many more transcripts than proteins were 
found to be differentially expressed, does not mean that most mRNA changes did not result in 
correlating protein changes, but merely reflects the fact that expression levels were only 
measured for many fewer proteins than transcripts . Tn particular, the total number of proteins 
whose expression levels could be visualized on silver-stained gels was only about 300 (page 
2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for which 
mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many fewer 
proteins than transcripts were measured, it is hardly surprising that a smaller absolute number of 
proteins than mRNAs were found to be overexpressed, because the protein products of most of 
the overexpressed mRNAs would not have been among the small number of proteins identified 
on the gels. 

The Examiner next cites Waghray et al, to the effect that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level," and that "[t]he change 
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in intensity for most of the affected proteins identified could not be predicted based on the level 
of the corresponding RNA " (Final Office Action of September 21, 2006). Appellants reiterate 
that they need only show that there is a correlation between mRNA and protein levels, such that 
mRNA overexpression generally predict protein overexpression. A showing that mRNA levels 
can be used to "accurately predict" the precise levels of protein expression is not required . 

Appellants also emphasize that Appellants are asserting that a measurable change in 
mRNA level generally leads to a corresponding change in the level of protein expression, not 
that changes in protein level can be used to predict changes in mRNA level. Waghray et al. did 
not take genes which showed significant mRNA changes and check the corresponding protein 
levels . Instead, the authors looked at a small and unrepresentative number of proteins, and 
checked the corresponding mRNA levels. Waghray et al. acknowledge that only "[a] relatively 
small set of genes could be analyzed at the protein level, largely due to the limited sensitivity of 
2-D PAGE" (page 1337, col. 1). In particular, while the authors examined the expression levels 
ofl 6,570 genes (page 1329, col. 2), they were able to measure the expression levels of only 1031 
proteins (page 1333, col. 2). Waghray et al does not teach that changes in mRNA expression 
were not correlated with changes in expression of the corresponding protein. All Waghray et al 
state is that "for most of the proteins identified, there was no appreciable concordant change at 
the mRNA level" (page 1337, col. 2). This statement is not relevant to Appellants' assertion of 
utility, since Appellants are not asserting that changes in mRNA levels are the only cause of 
changes in protein levels. Waghray et al do not contradict Appellants' assertion that changes in 
mRNA expression, in general, correspond to changes in expression of the corresponding protein. 

Lastly, the Examiner cites Sagynaliev et al, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et al, of 982 genes found to be differentially expressed in human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Final Office Action 
of September 21, 2006). 

The Sagynaliev et al. reference, titled " Web-based data warehouse on gene expression in 
human colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of gene 
expression data published in human CRC , and not from experimental data. While a literature survey 
can be a useful tool to assist researchers, the results may greatly over-represent or under-represent 
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certain genes, and thus the conclusions may not be generally applicable. In particular, Appellants 
note that, as evidenced by Lian et al, Nagaraja et al, and Waghray et al, discussed above, the 
number of mRNAs examined in transcriotomics studies is typically much larger than the number of 
proteins examined in corresponding proteomics studies , due to the difficulties in detecting and 
resolving more than a small minority of all expressed proteins on 2D gels. Thus the fact that only 
18% of all genes found to be differentially expressed in human CRC have been confirmed using 
proteomics technologies does not mean that the corresponding proteins are not also differentially 
expressed, but is most likely due to the fact that the corresponding proteins were not identified on 2D 
gels, and thus their expression levels remain unknown. 

The authors of Sagynaliev et al acknowledge the many technical problems in finding 
proteomic data for CRC that can be matched to transcriptomic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 
protein expression data obtained in the human patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) , 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations ... even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

Li et al 

The Examiner cites new reference Li et al as teaching that "68.8% of the genes showing 
over-representation in the genome did not show elevated transcript levels/' (Final Office Action 
,of October 13,2006). 

Appellants respectfully point out that Li et al acknowledge that their results differed 
from those obtained by Hyman et al and Pollack et al (of record), who found a substantially 
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higher level of correlation between gene amplification and increased gene expression. The 
authors note that "[t]his discordance may reflect methodologic differences between studies or 
biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). For 
instance, as explained in the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if they had a copy number ratio of at least 1 .40 . In the case of 
PRO 1009, as discussed in previously filed responses and in the Goddard Declaration (of record), 
an appropriate threshold for considering gene amplification to be significant is a copy number of 
at least 2.0 (which is a higher threshold). The PRO1009 gene showed significant amplification 
of 2,085 fold to 4.287-fold in twelve different colon primary tumors , and thus fully meets this 
standard. It is not surprising that in the Li et al. reference, by using a lower threshold of 1 .4 for 
considering gene amplification, a higher number of genes not showing corresponding increases 
in mRNA expression were found. Moreover, Appellants add that the results of Li et al. do not 
conclusively disprove that a gene with a substantially higher level of gene amplification, such as 
PRO 1009 , would be expected to show a corresponding increase in transcript expression. 

Therefore, the Patent Office has failed to meet its initial burden of proof that Appellants' 
claims of utility are not substantial or credible. The arguments presented by the Examiner based 
on references Pennica et al, Konopka et al, Haynes et al, Hu et al, Chen et al, Lian et al, 
Fessler et al, Lilley et al, King et al, Wildsmith et al, Nagaraja et al, Sagynaliev et al, and 
Waghray et al, Madoz-Gurpide et al, Celis et al, Li et al, do not provide sufficient reasons to 
doubt the statements by Appellants that PRO 1009 has utility as a diagnostic marker for colon 
cancer. Appellants once again remind the Examiner that only after the Examiner has made a 
proper prima facie showing of lack of utility, does the burden of rebuttal shift to the Appellant. 
Based on the above discussions, such a showing has not been made. Accordingly, the instant 
rejection should be withdrawn for the Examiner's lack of establishment of a prima facie 
showing. 

Moreover, Appellants acknowledge that, in certain instances, DNA/ mRNA and protein 
levels do not correlate. In fact, Appellants have included several such references directed 
towards a single gene or genes that lack a correlation in their IDS filed June 29, 2006. The 
references discussed in the Preliminary Amendment filed June 29, 2006, and the arguments 
therein are hereby incorporated by reference for brevity. The IDS included references that 
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studied single genes or gene families, multiple or large family of genes, and included studies that 
a wide variety of techniques, including gene amplification and microarray. Regardless of the 
techniques employed, by and large, increased genes/ transcripts levels mostly correlated with 
increased protein levels, even if accurate predictions of proteins could not be made . As 
discussed throughout prosecution, the law does not require the existence of a "necessary" 
correlation between DNA/mRNA and protein levels, or that protein levels be "accurately 
predicted." In fact, authors in several of the cited references (cited both, by the Examiner, and 
by Appellants) themselves acknowledge that there is a general correlation between protein 
expression and transcript levels and DNA levels, which meets the "more likely than not 
standard." 

In summary, Appellants maintain that, even though certain instances where DNA/ mRNA 
and protein levels do not correlate exist, in most cases , there is generally good correlation 
between DNA/ mRNA and protein levels, and this was collectively demonstrated in the more 
than 100 references submitted by the Appellants in the IDS filed June 29, 2006. 

D. The Gene Amplification Data Establishes Credible, Substantial and Specific 
Patentable Utility for the PRO1009 Polypeptide and its antibodies 

On the other hand, as discussed throughout prosecution, Appellants submit that Example 
170 of the specification further discloses that, "(a)mplification is associated with overexpression 
of the gene product, indicating that the polypeptides are useful targets for therapeutic 
intervention in certain cancers such as lung, colon, breast and other cancers and diagnostic 
determination of the presence of those cancers" (Emphasis added). Appellants have also 
submitted ample evidence to show that, in general, if a gene is amplified in cancer, it is "more 
likely than not" that the encoded protein will also be expressed at an elevated level. 

Besides the references, the Declaration by Dr. Paul Polakis (Polakis I - made of record in 
Appellants' Response filed June 18, 2004), principal investigator of the Tumor Antigen Project 
of Genentech, Inc., the assignee of the present application, explains that in the course of Dr. 
Polakis' research using microarray analysis, he and his co-workers identified approximately 200 
gene transcripts that are present in human tumor cells at significantly higher levels than in 
corresponding normal human cells. Appellants submit that Dr. Polakis' Declaration was 
presented to support the position that there is a correlation between mRNA levels and 
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polypeptide levels, the correlation between gene amplification and mRNA levels. The second 
Declaration by Dr. Polakis (Polakis II) presented evidentiary data in Exhibit B. Exhibit B of the 
Declaration identified 28 gene transcripts out of 31 gene transcripts (i.e., greater than 90%) that 
showed good correlation between tumor mRNA and tumor protein levels. As Dr. Polakis' 
Declaration (Polakis II) says u [a]s such, in the cases where we have been able to quantitatively 
measure both (i) mRNA and (ii) protein levels in both (i) tumor tissue and (ii) normal tissue, we 
have observed that in the vast majority of cases, there is a very strong correlation between 
increases in mRNA expression and increases in the level of protein encoded by that mRNA. ,, 
Accordingly, Dr. Polakis has provided the facts to enable the Examiner to draw independent 
conclusions regarding protein data. Appellants further emphasize that the opinions expressed in 
the Polakis Declaration, including in the above quoted statement, are all based on factual 
findings. For instance, antibodies binding to about 30 of these tumor antigens were prepared, 
and mRNA and protein levels were compared. In approximately 80% of the cases , the 
researchers found that increases in the level of a particular mRNA correlated with changes in the 
level of protein expressed from that mRNA when human tumor cells are compared with their 
corresponding normal cells . Therefore, Dr. Polakis' research, which is referenced in his 
Declaration, shows that, in general there is a correlation between increased mRNA and 
polypeptide levels . Hence, one of skill in the art would reasonably expect that, based on the gene 
amplification data of the PRO 1009 gene, the PRO 1009 polypeptide is concomitantly 
overexpressed in the colon tumors studied as well. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip® arrays in 2004. Clearly, the research community believe that the 
information obtained from these chips is useful (i.e., that it is more likely than not that the results 
are informative of protein levels). 

Thus, based on the asserted utility for PRO 1009 in the diagnosis of selected colon 
tumors, the reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 194 in 
the present application, the disclosure of the step-by-step protocols for making chimeric PRO 
polypeptides, including those wherein the heterologous polypeptide is an epitope tag or an Fc 
region of an immunoglobulin in the specification (at page 374, lines 24 to page 375, line 9), the 
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disclosure of a step-by-step protocol for making and expressing PRO 1009 in appropriate host 
cells (in Examples 140-143 and page 376, line 12), the step-by-step protocol for the preparation, 
isolation and detection of monoclonal, polyclonal and other types of antibodies against the 
PRO1009 protein in the specification (at pages 390-395) and the disclosure of the gene 
amplification assay in Example 170, the skilled artisan would know exactly how to make and use 
the claimed polypeptide and its antibodies for the diagnosis of colon cancers. Appellants submit 
that based on the detailed information presented in the specification and the advanced state of the 
art in oncology, the skilled artisan would have found such testing routine and not 'undue'. 

Therefore, Appellants respectfully request reconsideration and reversal of this 
outstanding rejections under 35 U.S.C. §101 and §112, first paragraph, to Claims 1 19-123. 



For the reasons given above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed polypeptide and its antibodies. As such, 
Appellants respectfully request reconsideration and reversal of the outstanding rejection of 
Claims 119-123. 

The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 08-1641 (referencing 
Attorney's Docket No. 39780-2730 P1C12) . 



HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 



CONCLUSION 



Respectfully submitted, 



Date: July 12, 2007 
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VIII. CLAIMS APPENDIX 

Claims on Appeal 

119. An antibody that specifically binds to the polypeptide of SEQ ID NO: 1 94. 

120. The antibody of Claim 119 which is a monoclonal antibody. 

121. The antibody of Claim 119 which is a humanized antibody. 

122. The antibody of Claim 119 which is an antibody fragment. 

1 23 . The antibody of Claim 1 1 9 which is labeled. 
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X. RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 

above. 
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DECLARATION OF AUDREY D. GODDARD, Ph.D UNDER 37 CF.R. § L132 

Assistant Commissioner of Patents 
Washington, D.C. 20231 



I, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2 . Between 1993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3 . My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 



Sir: 



Serial No,: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (Le., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et aL t Biotechnology 10:413-417 (1992) (Exhibit B); Livak et aL, PCR 
Methods Add!.. 4:357-362 (1995) (Exhibit C) and Heid et aL, Genome Res. 6:986-994 (1996) • 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polyraerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et aL, Proc. 
Natl, Acad. Sci. USA 95(25):14717-14722 (1998) (Exhibit E); Pitti et aL, Nature 
396(6712);699-703 (1998) (Exhibit F) and Bieche et aL, Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 



Serial No.: * 
Filed: * 

7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 



Date 



Audrey D. Goddard, PhD. 



AUDREY D. GODDARD, Ph.D. 



110 Congo St. 

San Francisco, CA, 94131 

415.841.9154 

415.819.2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Gerientech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1998-2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1998 



Scientist 



Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89-12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocytic leukemia 
translocation breakpoints on chromosomes 17 and 15. 

. • Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



1989 



Honours B.Sc 

"The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 

Imperial Cancer Research Fund Postdoctoral Fellowship 1989-1992 

Medical Research Council Studentship 1983-1988 

NSERC Undergraduate Summer Research Award 1983 

Society of Chemical Industry Merit Award (Hons. Biochem.) 1983 

Dr. Harry Lyman Hooker Scholarship 1981-1983 

J.LW. Gill Scholarship 1981-1982 

Business and Professional Women's Club Scholarship 1980-1981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 
2000 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy^ October 1991 



PATENTS 

Goddard A, Godowski PJ, Gurney AL. NL2 Tie ligand homotogue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D f Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25,2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues! Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 
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Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C t Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH t Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S., Lewin DA. (2001) BFIT f a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J. Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22 f a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954! 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, AL. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 



Audrey D. Goddard, Ph.D page 6 of 9 



Yan M, Lee J t Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficienUroute to human bispecific IgG. Nature Biotechnology 16(7): 677-681 . 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2UTRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor G/M . Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A, Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 81 8-821, 
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Goddard AD, Dowd P, Chernausek S, Geffner M f Gertner J t Hintz R, Hopwood N t Kaplan S, 
Piotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL t Moffat B, Vandlen R, Simmons L, Gu Q t 
Hongo JA, Devaux B, Poulsen K t Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE ( Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL t Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. ScL USA 93: 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D, Henzel W and Bass S. ( 1996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp. 187-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-71 13. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for ah Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-reiated receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. ScL USA 92: 1 866-1 870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6: 732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T. Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group, (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS t Moran P, Gripp J f Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. NeuroscL Res. 38: 705-715. 

Mark MR t Scadden DT, Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 1 0720-1 0728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocyte leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl3: 117S-119S. 

Zhu X, Dunn JM f Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell. Genet. 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254:1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet Cell. Genet. 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1580. 
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Myers JC, Jones TA, Pohjolainen E-R, Kadri AS, Goddard AD, Sheer D, Solomon E and 
Pihlajaniemi T. (1990) Molecular cloning of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
Genet 46: 1024-1033. 

Gallie BL, Squire JA, Goddard A, Dunn JM, Canton M, Hinton D, Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest. 62: 394-408. 

Goddard AD, Phillips RA, Greger V, Passarge E, Hopping W, Gallie BL and Horsthemke B. 
(1990) Use of the RB1 cDNA as a diagnostic probe in retinoblastoma families. Clinical 
Genetics 37: 117-126. 

Zhu XP, Dunn JM, Phillips RA, Goddard AD, Paton KE, Becker A'and Gallie BL (1989) 
Ggrmline, but not somatic, mutations of the RB1 gene preferentially involve the paternal 
allele. Nature 340: .312-314. 

Gallie BL, Dunn JM, Goddard A, Becker A and Phillips RA. (1988) Identification of mutations 
in the putative retinoblastoma gene. In Molecular Biology of The Eve: Genes, Vision and 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 88. 
J. Piatigorsky, T. Shinohara and P.S. Zelenka, Eds. Alan R. Liss, Inc., New York, 1988, pp. 
427-436. 

Goddard AD, Balakier H, Canton M, Dunn J, Squire J, Reyes E, Becker A, Phillips RA and . 
Gallie BL. (1988) Infrequent genomic rearrangement and normal expression of the putative 
RB1 gene in retinoblastoma tumors. Mot. Cell. Biol. 8: 2082-2088. 

Squire J, Dunn J, Goddard A, Hoffman T, Musarella M, Willard HF, Becker AJ, Gallie BL and 
Phillips RA. (1986) Cloning of the esterase D gene: A polymorphic gene probe closely linked 
to the retinoblastoma locus on chromosome 13. Proc. Natl. Acad. Sci. USA 83: 6573-6577. 

Squire J, Goddard AD, Canton M, Becker A, Phillips RA and Gallie BL (1986) Tumour 
induction by the retinoblastoma mutation is independent of N-myc expression. Nature 322: 
555-557. 

Goddard AD, Heddle JA, Gallie BL and Phillips RA. (1985) Radiation sensitivity of fibroblasts 
of bilateral retinoblastoma patients as determined by micronucleus induction in vitro. Mutation 
Research 152: 31-38. 
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Russell Higuchi*, Gavhx Bollinger 1 , P- Sean Walsh and Robert Griffith 

Roche Molecular System*. Inc., 1400 55rd St., Emeryville CA 94008. 'CWron Corporation, 1400 *3rd Su Emeryville, CA 
94608, ♦Corresponding; author. 



Wc have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress, tile ability to simulta- 
neously amplify specific DNA setjuences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

Although the potential benefit* of PGR 1 to. clin- 
ical diagnostics arc weft^ known** 5 , it is still not 
widely used in this setting, even though it is 
font- year* ciuco thcrmostabl* I>NA poty™**-* 

ase* 4 made PGR practical Some of the reasons for it* slow, 
acceptance are high oust, tack of automation of pre-? and 
post-PCR processing steps, and false positive results, from 
carryover-contamination. The first two point* are related 
in that labor is the largest contributor to cost flit the present 
stage of PGR development. Most current assays require 
some form of "downstream" processing once thermocy- 
eHng ts done in order -to determine whether the target 
DNA sequence was- present and has amplified. These 
include DNA hybridation* 6 , gel electrophoresis with or 
without use of restriction digestion*;*/ HPLCr, or capillary 
electrophoresis 10 . These methods are labor-intense, have, 
low throughput, and are difficult to automate. The third 
point is also ck^cty related to downstream processing. 
The handling of the FCR product in these downstream 
processes increases the chances that aropfiBed DNA 'will 
spread through the typing* lab, resulting in * a .risk of 



"carryover'* false positives in subsequent testing 1 , 

These downstream processing steps would be elimi- 
nated tf specific amplification and detection of amplified 
DNA took place simultaneously vithin an unopened re- 
action vessel Assays in whkh such different processes take 
place without, the need to separate reaction components 
have been termed ^mogeneous' 1 - No truly hbmogc-. ] 
neons PGR assay has been demonstrated to date, akhough 
progress towards this end has been reported. Chehab, et 
aL l * developed a FCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product Allctc^pecific primers, each with different fiuo- 1 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporaied primers tnust still be 
removed in a downstream process in order to visualize the 
result Recently, HoBand, et al, 13 , developed an assay in 
which the endogenous 5' exooudease assay of Taf DNA 
polymerase was exploited to deave a labeled cflgonucJeo- 
cide probe. Hie probe would only dcave if FCR arap£h- 
earjon had produced its complementary sequence. In 
order to detect the dcavage product?, however, a subse- 
quent process is again needed. 

We have developed a truly homogeneous assay for PGR 
and PCR product detection based upon the gready in- 
creased fluorescence that ethidiura btoinkle and other j 
DNA binding dyes exhibit when they are bound tojb- 
DNA 1 ^ 16 . As outlined in Figure I, a prototypic PGR 
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1 Principle of simultaneous ampGficotkm and deletion of 



PCR produce The coitipbacntsof a PCR COGttrtnmg Ei3r that mi 
Ouoresceni are listed— EtBr itselt EtBr bound tp other ssDNA or 
daDN A Tbete is a large auorcsccrtcc enhancement when EtBr |s 
bound to DNA and buniing is greatly enhanced when DNA is 
doublc-urandccU After sutudcru <n)..cydex of PGR. the net 
increase xn d-ipNA vesidts in addkiooai EtBr butdSn^, and H net 
increase in total tiurtracence: 
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2 Gd electrophoresis of FGlt ^mplificatzon products of the 
human, txudcar gene, HLA made in the presence of 

increasing amounts of EtBr (up to 8 f^g/tnl). The presence of 
£tBr has no obvious effect on the yield or specificity of amplifi- 
cation. 
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RGCnff S (A) Ftttotesccnce measurement* from PCRs that contain 
Hgftnl EtBr and that ate specific for Y^romosourc repeat 
sequences. Five replicate PCRs were begun containing cadi of the 
DNA* specified. At each hujicacd cyde, one of the five rep&eate 
PCRs for cadb DNA -was removed from tHcrmocydmg and Hs 
fluorescence measured. Unit* of fluorescence are arbftrary. (B) 
UV photography of PGR tubes (0\5 ml Eppcndorforylc, poiypro- 
pylcne microcentrifuge tubes) containing reactions, those *Ca.ru 
ing from 2 ng male ON A and control reactions Without any DNA, 
from (A). 



begins with primers that are smgle-strandcd DNA (ss- 
DNA), dNTPs, and DNA polymerase; An amount of 
dsDNA ccntairimg the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micro crams per PCR* 8 , If EtBr is present, the reagent* 
that will fluoresce, in order of UKrcaslng fluorescence, are 
fro: EtBr hsclft and EtBr bound to tie singles tranded 
DNA primers and to the doubles tranded target DNA (by 
its mtercalation between the stacked bases of the DNA 
dovbk^hcfSx). After the first denatu ration cycle, target 
DNA will be largely $io#c-*tranded. After a PCR is 
completed, die most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several mkrd£rarns- Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much less than to dsDNA, the effect of this change on 
the total fluomccricc of the sample is small* The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy- 
cHng. 

RESULTS 

PCR in the presence of EtBr. In order to' assets the 
affect of EtBr in PGR, amplifications of Ihe human Hl«4 
DQa gene* 9 were performed with the dye present at 
concentrations from 0,06 to 8.0 ufj/ml (a typical concen- 
tration of EtBr used tn staining of nucleic aods fojEtowing 
get electrophoresis is 0.5 u^g/mf). As shown in Figure 2> gel 
electrophoresis revealed little or no difference in the yield 
or quality of the amparkadon product whether EtBr was 
absent or present at any of these coticenti^ttotis, indicat- 
ing that EtBr does not inhibit PCR, 

De tection 



of human Y^dttomosonM specific se- 
quences* Sequen<%^peoric, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 u.g/mi EtBr and primer* 

?>edftc to repeat DNA sequences found on the human 
-chromosome* 0 - These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA« After 0» 
17, Jfl, 24 and 29 cycles of thermocyding, a PCR for each 
DNA was removed from the therraoeyder, and its fluo- 
rescence measured in a spectroftaorometer and plotted 
vs. amplification cyde number (Fig. 3 A), The shape of this 
curve Tcficcts the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exr^nenual with cycle number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for die PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PGR*, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were needed t& give a detectable increase in fluo- 
rescence. Gel eJectrwnoresi* on the products of these 
amplifications showea that DNA fragments of the ex- 
pected she were made in the male DNA containing 
reactions and that Hide DN A synthesis took place in the 
control samples. 

In addition, the increase in fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
trandUumuiator and photographing them through a red' 
filter. This is shown in figure 3B ibr the reactions thai 
began whh 2 ng male DNA and those with no DNA. 

Detection of specific alkie* of the human fl-globin 
gene In order to demonstrate that this approach has 
adequate specificity to allow genetic screening, a decoction 
of the $ickle-cdl anemia mutation was performed- Figure 
4 shows the fluorescence from oompleted amplication* 

containing ttBr (0.5 i^gfoiT) as detect** by photography 

of the reaction tubes on a UV txan^iUnmjnator, These 
reactions were performed using- primer* specific for ci- 
ther the wild-type or sickle-cell mutation of the human 
p-globin gene". The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal 3' 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer e%te^on-~and thus an> 
pUftcation— «can take place only if the 3' nucleotide of the 
primer is complementary to the g-gtobin allele present* 1 * 
Each pair 0? amplications shown in Figure 4 consists of 
a reaction with either the wildHype allele specific (left 
tube) or stckk-allele specific (right tube) primers. Three 
different DN As were typed: DNA from a Homozygous, 
wHd-type p~globin individual (AA); from a heterozygous 
sickle p-g*obin individual (AS); and from a homozygous 
sickle p-gio&u indrridual (SS). Each DNA (50 ng genornK 
DNA to start each PGR) was analyzed in triplicate (3 pairs 
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c f reactions each)* The DMA .type vas reflected in the ; 
fffa&ve fluorescence intensities in each pair of completed 
Am p]t£catk>ft£. There was a significant increase in fluores- 
(croc only where a ^globin allele DNA matched the 
primer set. When measured on a spcttroflnoromctcr 
fo^ta not shown), this fluorescence was about three times 
jttt present in a PGR where both 0-globin alkies were 
Snatched to the primer set* Gel electrophoresis (not 
phown) established that this increase in fluorescence was 
<j«e to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for p-globin. There was 
frdc synthesis of dsDNA in reactions m. which the allele- 
spedfic primer was mismatched to both alleles. 

Continuous nwoitoribog of a PCR. Using a fiber optic 
devkerit is possible to direct excitation illumination from 
,-i spectrofluorometer to a PGR undergoing thcrmocydiog 
a nd to return its fluorescence to the ispectroftuoromcter. 
The fluorescence readout of such an arrangement, di- 
rected at an Etfir-concaining amplification of Y-chromo- 
pome specific sequences from 25 ng of Whwp mate DNA« 
is shown in Figure 5. The readout from a control PCR' 
nfch no target DNA b also shown. Thirty cycles of PCR 
were monitored for each. 

The fluorescence trace as a function of time clearly 
shows the effect of the thermocydiiig. Fluorescence inten- 
sity rises and bib inversely with temperature* The fluo- 
rescence intensity is minimum at the denaturation tem- 
perature (94°C) and maximum at the annealin ^extension 
temperature (50°C). In the negative-control rCR, these 
fluorescence maxima and minima do not change. signj&- 
canUy over the thirty tbexmocyck^ indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is little if any bleaefung of EtBr during 
the continuous illumination of the sample. 

Jn the PCR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds' of therroocyding, and 
continue to increase wHb time, Indicating that dsDNA is 
being produced at a detectable level Note that die fluo- 
rescence minima at the denaturatfoa temperature do not 
nignificanlly increase, presumably because at this temper- 
ature there is no dsDNA for EtBr to bind- Thus the course 
of the amplification is followed by tracking the Auorcs-. 
cence increase at the auncaHng; temperature. Analysis of 
the products of these two amplifications by gel elcctropho- 
txfih showed a DNA fragment of the expected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quenee-«pedflc probe can enhance die specificity of DNA 
deceuuVHi Vr> PGR, The cHmioatkm of iheac processes, 
means that* the specificity of this homogeneous assay 
depends solely on that of FCR. la the case of skkle-celi 
disease, we have shown that PGR alone has sufficient DNA 
sequence speeificky to permit genctk screening. Using 
appropriate amplification conditions, there b Bttlc non- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The spedficky required to detect pathogens can be 
more or less than that required* to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with die 
sample. A difficult target is HIV, which requires detection 
of a viraJ genome that can be at the level of a few copies 
per thousands of host cells 5 . Compared with generic 
•screening, whkh is performed on cells containing at least 
one copy of the target sequence* HIV [detection requires 
both more specificity and the input of more fcrtal 
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HWMMUV photography of 1^ tubes cOTtaininjai 
using EtBr that are specific to viloVtypc (A) or lidkk attdc* of 
the hitman (V-gfobin gene. The left of each pair tubes centains 
aHele-ftpcdHe primers to the wild-type ancles, the right tube 
printers to the skWe attek- The photeffiraoh taken after 30 
cycles of PCR, a&d the input DNAs and the alleles jhey cootam 
are indicated- FSfty ng of DNA was used to begm PGR. Typing 
was done in tripticaCc (3 pair* of PCRs) lor each mpu% DNA: 
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20ng of male DNA 
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RCtRES Continuous, real-time monitoring of a PCR. A fiber optic 
was vised to carry excitation Kgbt to a rUR to progress and 3tl*o 
emitted light back to a Buoromctcr (see Ex^ntncntal JrVotocol). 
AmpHficaUca-tiStOg-huinan malo-PNA specific primer* in a PCR 
Starting with tO rtg of human male £>NA {tofh or jn a coatrbt 
PCR without PNA (bottom), were monitored. Thirty cycles of 
PCR were followed for each. The* temperature cyded between 
94*C {denaturadon) and 50*0 (annealing and extension). Note in 
the male DNA PCR, . the cycle (time) dependent increase in 
fluorescence at the antieaEn^ejrteiJ«on temperature. 
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DNA-— up to microgram amounts — in order to have suf- 
ficient numbers of target sequences. This large amount of 
starling DNA Ht an amplication sigattteautly increases 
the background fluorescence over which amy additional 
fluorescence produced by PCR must be detected; An 
additional complication, that occurs with targets in low 
copy-number is the formation of the "primer-dimer" 
artifact. This vs the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PCR amplification, and can compete whh 
true PCR targets if those targets are rare. The primfer- 
dimcr product U of course dsDNA and thus is a potential 
source of false signal in this homogeneous assay. 

To increase PCR specificity and reduce the effect of 
primer-dimcr airtpUrVcatioa* we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications that take place tn a single tube 8 , and the 
*hot-start\ in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins". Preliminary results using these ap- 
proaches suggest tbacpruner-dirocT is effectively reduced 
and it is possible to detect the increase in Etfir fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* celts. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To. reduce this background, it may 
be possible to use sequence-specific DNA4>inding dyes 
that can be made to Deferentially bind PCR product over 
genomic DNA by uKOrporaung the dye-binding DNA 
sequence into the PCR product through a S* *add-on" to 
the oiigonwdeodde primer* 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBrnxintaming PCR is straightforward, 
both once PCR is completed and continuously during 
Uiermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is alrc^yjpossiblc with existing instru- 
mentation in 96-well format**. In tliis formal; the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during therraocycung by moving 
the rack of PCRs to a 96-microwcM plate fluorescence 
reader 40 , \ 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle, 
A direct c?c tension of the apparatus used here is to have 
multiple fiberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target UNA copy number. Figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase is detected* Prdirai- 
nary experiments {Higuchi and DolUnger, manuscript in 
preparation) with continuous morutoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
fcnown— a* it can be in genetic scTeeaiing-^contuiuons 
monitoring may provide a means pf detecting false posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR* 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
resides due to, for example, inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
ineffjeiendy ampfcfying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles—many more* than are necessary to detect a true 



positive- If a sample fails to have a fluorescence increase 
after this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event oefore any test based on thk 
principle is ready for the clinic, ah assessment of te false 
positive/false negative rates wfli need to be obtained using 
a large number of known samples. 

In summary , the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplificarion from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR. 
m applications that demand the High throughput of 
samples* 

EXPERIMENTAL PROTOCOL 

Hciman HLA-DQct gen* AmpliBcadons containing 
PCRs were set up in 100 |U volnmes containing 10 mM Tris-*HC1 % 
pH 8.S; 50 mM KCl; 4 nM MgC^: S-S units of To* DNA 
pedywerwe (P***m*E}mcr Ceuu, Norwalk, CT); 20 piriolc each 
of human HlA-DQa gcnc specific cOigonuclcodcie primers 
(Ht$6 and CHS7 19 and approaomately 10* copies of DQft PCR 
product diluted from a previous reaction. Ethidium bromide 
fe&r; Stgtn.\> used at the ccmceatrauonii indicated id Figure 
2. Thcraocyding proceeded for 20 evdes in a model 480 
uScnnocydcr <*ertin-Eljner Ccw, Norwallt, CT) ««nga"ftcp- 
cyclc" program of 94*C for I miu- dcnaturatKwi and (XrC forsO 
sec annCaBng and 72*C for SO sec estenston. 

Y-chromowmc specific PCR* PCRs (100 ul total reaction 
volume) contahiing^ ttfi/«" EtBr were prepared as desenbed 
For HLA-DQor, except with diflercnt primers and target DNAs. 
These PCRs contained 1 5 prnotc each male DNA-tpcctfic pnmen 
YI.l and Yl.2*\ atid cither 60 ng male, €0 Ug female, 2 ng mak, 
or no human DNA. Thcrmocycling *^s94 6 C?or 1 nun ; and 60fC 
for 1 min using a "stcp<yde* program. The number ©t cycles tor 
a sample were as indicated in *lg»«X 3. Fluorescence measure- 
ment is described bcJow. - % 

Alick specific, human p-gtobift ejw* PGR* Amplificauons of 
100 fd volume' using 0 5 |tg/ml of &Br were prejwrcd aj 
described for HLA^DQci above except with different primer* and 
target DNAs. These PCRs cOUUdntd *u>er primer pair HOP?/ 
H814A <wiW-type g»obi« specmc prifnens) or HCP*VH|H4$ (fjek- 
le-riobin specific primers) at 10 pmole e*ch pnrocr per PCR™ 
These winters were developed by Wu ct aL 21 . Three different 
Ucgei DN Ait *ere tucd in separate amplication*-- 50 ng each of 
human DNA that was homozygous for the *kYk trah (SS), DNA 
that was rietcroryrow for the sickle Irak (A$X or DNA that 
homozygous for the W.t- globin (AA). Thcrmwycnng .wtt W » 
cycles at *TC for i mm. and $5<C Cor 1 min* itsbf a H stcp-cyck 
1 * — - -rature of 55^ hcen .shown try 



,/oet aL 21 to provide allcTc^pocirk atPpIif&tion. OwnpJctcd 
PCIU were photographed through a red ^W**™^ 3 *) 
after pladng the rcacbon lubes a«jp amoddTM^6 tranafflutni- 
nator (U V-producti San- Gabriel, CA). 

Fhiotese^Kesneasnraneiit. R^tescence racasuranenw wet* 
tna<U on PCRs containing EtBr m a Fluorolog-2 flUorwnCtcr 
(SPE5C* Edison. Nl). Excitation was at the &00 nm band wan 

Crist Inc^ Irvine. CA) to exclude scccmd-ordcr light >w™f* 
fight was detected at 570 nm with a bandwidth of about 7 nm, An 
OG 530 pm cut-oft" filter was used to remove thcraatjuoo ngftt 
ConatHtoa* n^oreflccnee monitormg of FCR, Connnuous 
monitoring of a PCR in progress was accomplished using a* 
Bpcctrofluoronietcr and setdnga descrcboa above as 
tfeoptk accessory (SPEX at no. 1950) to both send exotauon 
Rght to, and receive emitted light from, a PCR pW m awcRol 
a modd *00 mcrtnocyder (Ftrk'm-Elmer Cetu*). The probe e«d 
of the fiberoptic cable was attached with **$ mrn.utc-cpox>' to Oft 
open top of a PCR tube (a 0.5 ml polypropylene centrifuge WW 
wtth Hs cap removed) efTeetrveW fleeing u* Tlrccxpo^ W ^ 
me PCR tube and the end of the nberopuc caWc were ^jeldcd 
from room light and the rooco lights were kept dmuned dvrmg 
each run- The rnonitorcd PCR was an awplxhcludn of Y*<* ^ 
tiKwn^pednc repeat seoveucet as dc^cr^ed above. cxceF 
usnjff an anncahngtetenskm tenrperauirc of 50X. The reacocra 
was covered wkh roii*eral oil (2 drops) to prevent crvaporanon. 
TTicrn^cW and Buorrsccncc roc^ccment were 5tartedn- 
multancously, A thnc-basc scan with a 10 second mtegranou tone 
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uM*d And the «mbsioa signal was ratioed to tbc cxctotioo 
n jg(VtJ to control foe change* in light-source intensity. i>ata were 
Reeled "sing the dm5000f t verwon 15 (SPEX) data system. 

Wc «tarik Bob Jonc* for help with the sptctrofluormctric 
UKywiremcnU *nd HcalhcrbeM Fony for editing this manuscript. 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J, Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

Perkin-Elmer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 

I quencher dye attached. An Increase 
|h reporter fluorescence intensity In- 
Bicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' nucle- 
olytic activity of Taq DNA polymerase. 
In this study, probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3'~end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3 - 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 -end nucleotide also exhibited 
an increase In reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
Bf attached at opposite ends can 
used as homogeneous hybridiza- 
tion probes* 



homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fiuoro- 
genic probe was described by Lee et al. (0 
The assay exploits the 5' -» 3' nucle- 
olytic activity of Taq DNA poly- 
merase (2 ' 3) and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET).< 4 ' 5 > During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' 3' nucleolyric 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an Increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (€) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 



MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphorarnidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphorarnidite, 6-carboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocklng phosphate, 
were obtained from Perkin-Eimer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-iabeied phos- 
phorarnidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of 5' nuclease assay. Stepwise representation of the 5' - 3' nucleolyric ac- 
tivity of Taq DNA polymerase acting on a Autogenic probe during one extension phase of PCR. 



rriM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5* end. 



PCR Systems 

All PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-jtl reactions that con- 
tained 10 mM Tris-HCl (pH 8.3), 50 mM 
KC1, 200 jjum dATP, 200 \lu dCTP, 200 jjlm 
dGTP, 400 \>m dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295;bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajima-U|ima et al.) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (B) Actin am- 
plification reactions contained 4 mM 
MgCl 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 hm each 



TABLE 1 Sequences of Oligonucleotides 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
piasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUCl 19. 
These reactions contained 3.5 mM 
MgClz, 1 ng of piasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (I min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-^1 
aliquot of a sample was transferred to an 
individual well of a white, 96-well rnlcro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a SIS-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
Is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



Name 



Type 



Sequence 



FU9 

R119 

P2 

P2C 

i»5 

P5C 

AFP 

ARP 

Al 

A1C 

A3 

A3C 



primer 
primer 
probe 

complement 
probe 

complement 
primer 
primer 
probe 

complement 
probe 

complement 



ACCCACAG G AACTGATCACCACT C 

ATGTCGCGTTCCGGCTGACGTTCTGC 

TCGCATTAClGATCGTrGCCAACCAGTp 

GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITFGCTGGTATCTATGACAAGGATp 

TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 

CAGCGGAACCGCrCATTGCCAATGG 

ATGCCCTCCCCCJ\TGCCATCCTGCGTp 

AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCrGGACTTCGAGCAAGAGATp 
CCATCfCTTGCTCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this study, the nucleic add sequence is given, written in the 
5' 3' direction. There are three types of oligonucleotides: PCR primer, fluorogenlc probe usea 
in the 5' nuclease assay, and complement used to hybridize to the corresponding probe. For me 
probes, the underlined base Indicates a position where IAN with TAMRA attached was suDsn- 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A 1 -2 RAQGCCCTCCCCCATGCCATCCTGCGTp 

A1 -7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1 -1 4 RatcccctcccccaQgccatcctgcgtp 

A1 -1 9 RatgccctcccccatgccaQcctgcgtp 

A 1-22 RATGCCCTCCCCCATGCCATCCQGCGTp 

A1 -26 RATGCCCTCCCCCATGCCATCCTGCGQp 



Probe 


518 nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ tomp. 








A1-2 


25.5 ±2.1 


32.711.9 


38.2 ± 3.0 


38.2 ±2.0 


0-67 ±0.01 


0.86 ±0.06 


0.19 + 0.06 


A1-7 


53.5 ±4.3 


395.1 ±21.4 


108.5 ± 6 3 


11 0.3 ±5.3 


0.49 ±0.03 


3.58±0.17 


3.09 ±0.18 


A1-14 


127.0±4.9 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6,3 


1.16 ±0.02 


4.34 ±0.15 


3.18 ±0.15 


A1-19 


187.5 ± 17.9 


422.7 ±7.7 


70.3 ±7.4 


73.0 ±2.8 


2.67 ±0.05 


5^0 ±0.15 


3.13 + 0.16 


A1-22 


224.6 + 9.4 


482.2 ± 43.6 


100.0 ±4.0 


96.2 ±9.6 


2.25 ±0.03 


5.02±0.11 


2.77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 ±18.4 


93.1 ±5.4 


90.7 ±3.2 


1.72 ±0.02 


5.01 ±0.08 


a29±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 

eaction tube. This normalizes for well- 

^Wo-well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
<RQ + ). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher we^re tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the 5 r nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ ~ depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes AM4, Al-19, Al-22, and 
Al-26 probably have reduced quenching 
as compared with A 1-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficientfy the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 


54.6 ± 3.2 


84.8 ± 3.7 


116.2 ±6.4 


115.6 + 2.5 


0.47 ± 0.02 


0.73 ± 0.03 


0.26 ± 0.04 


A3-24 


72.1 ± 2.9 


236.5 ± 11.1 


84,2 ± 4.0 


90.2 ±3.8 


0.86 ± 0.02 


2.62 ± 0.05 


1.76 ± 0.05 


P2-7 


82.8 ± 4.4 


384.0 ± 34.1 


105.1 ± 6.4 


120.4 ± 10.2 


0.79 ± 0.02 


3.19±0.16 


2.40 ± 0.16 




113.4 ±6.6 


555.4 + 14.1 


140.7 ± 8.5 


118.7 ± 4.8 


0.81 ± 0.01 


4.68 ± 0.10 


3.88 ± 0.10 


™io 


77.5 ± 6.5 


244.4 ± 15.9 


86.7 ± 4.3 


95.8 ± 6.7 


0.89 ± 0.05 


2.55 + 0.06 


1.66 ±0.08 


PS-28 


64.0 ± 5.2 


333.6+ 12.1 


100.6 * 6.1 


94.7 ± 6.3 


0.63 ± 0.02 


3.53 ± 0.12 


2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Materia! and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end ; there is a marked Mg 2 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 " 1 " concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 24 " is only slightly higher than 
RQ at 10 mM Mg 2+ . For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2+ are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 **" followed by 
a gradual decline as the Mg 2f " concen- 
tration increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2 " 1 " with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2+ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION . 

The striking finding of this study is that 
it seems the rhodamlne dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This Implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 run 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.4S 


0.50 


Al-26 


43.31 


509.38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


54.63 


121.09 


0.64 


0.58 


P2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


P5-10 


27.34 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded, The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentration of SO nw indicated probe, 10 mM Tris-HCl (pH 83), 50 mM KC1, and 10 mM MgCl 2 , 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 nM 
PSC for probes P5-10 and PS-28. Before the addition of MgCl* 120 ul of each sample was heated 
at 9S°C for 5 min. Following the addition of 80 of 25 mM MgCl 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end f or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ~ include 
the particular reporter and quencher 
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mM Mg 

FIGURE 3 Effect of Mg 2 * concentration on RQ ratio for the Al series of probes. The fluorescence 
emission Intensity at 518 and S82 am was measured for solutions containing 50 nM probe, 10 mM 
Trls-HCl <pH 8.3), 50 mM KC1, and varying amounts (0-10 mM) of MgC! 2 , The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 

•context effects, presence of structure or 
kther factors that reduce flexibility of 
uie oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T ral presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor Is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe/ 0 

The rise In RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching Is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
an internal position. In effect, a 
^^^encher at the 3' end is freer to adopt 
^informations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ~ values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ~ 
value. For the PS probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay, in this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et aL u> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 

PCR Methods and Applications 36 f 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes, The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. <10) describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two Imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR amplifi- 
. cation. 
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Wel^ve developed a novel "real time" quantitative PCR method. The method m««ir« PCR prodnn 
^?2d7th£H«h a duMM nuowfnlc probe Tartan ^g^^iSJ 
accurate and reproducible quantitation of *enc copies. Unlike otter quantitative PCR methods, rcaWime.n-K 
I? nc^ handUnJ preventing potential PGR product carry-over contamination and 

esultin, iTmuch faster and higher throughput assays. The -Mime PGR .Hf i JJ^^^SS 

ranee of carting tar B et molecule determination {at b»i five orders o(-ra«nitUdc). ReaWme quantitative 
PCR is extremely accurate and tessMabor-in tensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence analysis Jias 
had an important role in many fields of biologi- 
cal research. Measurement of gene expression 
<RNA) has b«e.n used extensively In monitoring 
biological responses lo various stimuli (Tan et al. 
1991; Huanft el al. I995a,b; Prud'hommc et. al. 
1995), Quantitative gen*? analysis (PNA) has 
.iH-en used to determine the genome quantity of ;* 
particular gene, as in the case, ox the human HEK2 
gene, which Is amplified in -30% of breast tu- 
mors (Slarnon v.i al. 1987). Gene and genome 
quantitation (DNA and UNA) also have been used 
fur analysis of human immunodeficiency virus 
(HlV) burden demonstrating changes in the lev- 
els of virus throughout the different phasic of the 
disease (Connor et al. 1993; Platak el al, jvv.sh; 
Purtado et al. 1995). 

Many methods have been described for the 
quantitative analysis ot nucleic acid sequences 
(both for UNA and DNA; Southern VJ /6; Sharp et 
al. 1980; Thomas 1980)." Recently, PCR lias 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase -(RTJ-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made pos- 
sible many experiments lhat could ziot hove been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 



6T0lg] 



that it be umsU properly for quantitation (U»«y- 
maskers 1995). Many early reports of quantita- 
tive PCR and RT-PCR described quantitation of 
the PCR product but did not measure the initial 
target sequence quantity. It is essentia] to design, 
proper controls for the quantitation of the initial . 
target sequences (Pcrrc 1992; Clement I el al. 
1003) 

Kv.s*ftrchcis hav*. developed several methods 
of quantitative PCR and RT-PCR, One approach 
measures PCR product quantity in the log phase 
of the ri'Htflon before the plateau (Kellogg et al. 
1990; Pang ct ah 1990). This method requires 
thai each sample has equal input amounts of 
nucleic- add and that each sample under analysis 
amplifies with identical efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained hi «H samples at relatively constant quan- 
tity such as p-aciln) can be used for sample, 
amplification efficiency normalization. Using 
conventional methods of PCR. detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure, 
that all samples are. analyzed during the log phase 
of the reaction (for l>oth the target gene and Ihc 
normalization gene). Another method, quantita- 
tive competitive (QC)«PCR, has been developed 
and Is used widely for PCR quantitation. QC-FCU 
rnlics on the inclusion of an internal control 
competitor in each reaction (Beckcr-Andrc 1991; 
PiataK ct al. 1991W>), The ^ucicmcy of each re 
action is normalized to the internal competitor. 
a wnnwn auuiunt of internal competitor can be 
30HX Z0S6 091 6*6 XVd 6S'.*T Z00Z/S0/ZT 
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added to each cample. To obtain relative n»anh 
ration, the unknown largct PGR product is com- 
pared with the known competitor t'CK product, 
.Success of a quantitative competitive i'CU assay 
relies on developing an internal control lliai am- 
j *1 i rik-rs with the same efficiency as the angel 
cculc. The design °' tnc coinpcthoj and the vali- 
dation of amplification efficiencies, jcquirc a 
dedicated effort. However, because QC-PCK does 
not require that P(-tt j nod nets be an jdyzud during 
the log phase of the. amplification, it is the. easier 
v »r tht: two methods to use. 

Several detection systems uie used for quan 
Utative 1*CR and UT-PcMt analysis; (1) agarose 
gels, (2) /luuresex-nt labeling of PGR products and 
detection with ln.itTr-imiucr.cl fluorescence using 
capillary electrophoTcsia (haseo et al. 1995; Wil- 
liams eT ah 1996) or acrylaiuldc gels, and (3) phtte 
capture and sandwich probe hybrid tan inn (Mul- 
der cs1 ah 1994). Although these method* puivrd 
successful, each method requires post-PCK ma- 
nipulations That add Tim*, to the analysis and 
may lead to hfuuraUny i niilmt'iination. The 
sample throughput of thc*e iiirtliuds i.s limited 
( wi|)i the i-xe.cpilon of the plate capture ap- 
proach). ,wnd, th«r«u-*Te., these methods are not 
well suited fin u.so demanding high sample 
throughput (I.e., screening of large numbers of 
hloiuwlcrt.ulc7» tii analyzing samples fui diagnos- 
tics or clinical trials). 

Here we report the development of a novel 
ussay for quantitative. DNA analysis. The assay is 
baiyi-td on the usr. of the .V nuclease assay first 
described by Holland et al. (1991). The method 
uses the, 5' nuclease activity of 7Vi</ polymerase lo 
cleave a noncxtcndlblc hybridist km probe dur- 
ing the extension pi \ oar of 1'OU. Tin: approach 
uses dual -labeled fiuorogenic hybridisation 
probes (Lcc. ct a3. 1993} Hauler ct al. 1993; Uvak 
ct al. 1996o,b). One fluorescent dye serves a 
reporter |PAM (i.e., Gearbox /fluorescein)! and hs 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (i.e., o-carboxy-i et ram ethyl- 
rhpdaminc). The nuclease degradation of the hy- 
bridi/i4tton probe releases the quenching of Ihe 
I 'AM fluorescent emission, resulting in an In- 
crease in peak fluorescent emission at wn. 
The use Of a sequence detector (Alii Prism) allows 
measurement of fluorescent spectra oi ail 90 wells 
of the thermal cycler continuously during the 
l'OK amplification. Therefore, the reaction* uje 
monitored in real lime. The output data is de- 
scribed and quantitative analysis of input target 
I )NA sequences is discussed below. 



RESULTS 



PCR Product Derecrlon in R«al Time 

The goal wax to develop a high-throughput, sen- 
xitivr, and n neural c gene quantitation assay for 
U5C I" monitoring lipid mediated therapeutic 
gene deli very. A plasinld unending human factor 
VIII gene sequence, pI-8TM (sec. Methods), was 
lifted as a model therapeutic w&w. The assay iisr* 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence in real 
time (Alii Prism 7700 Sequence Detector). '11 1« 
Taqman reaction requires a hybridization probe 
labeled with two different fluorescent dyes. One 
dye is a reporter dy* (1 ; AM), the other in x quench- 
ing dye (TAMRA). When the pruln: is inlacl, fluo- 
icaccnt energy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAMRA) . During Ihc extension 
phase of the PCR cycle, the. fluorescent hybrid- 
ization pfoi>c Is cleaved by the 5'-.'*' nttcicolylic 
activity of thr. DNA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
sultinK hi uii increase of the reporter dye fluOrOfc- 
cent emission ftp*Ctro. VCR primers and prubi*K 

were designed foi the human factor VJ II se- 
quence and human p-actln gene (as described in 
Methods). Optimization reactions were per- 
formed to choose the appropriute probe and 
magnesium concentration* yielding the highest 
Intensity of reporter fluorescent sign«l without 
sacrificing specificity. The Instrument uses a 
chaiwcouplcd device (i.e., CCD ctuncru) for 
measvtring the fluorescent emission spcetm from 
. r i(>0 to r»$0 nm. l-ach PC:U tube was monitored 
se^juentially for 2& msec with continuous moni- 
toring throughout the amplification. Each tube 
was rr-exjindncd every B.5 set*. Computer 'soft- 
ware, was designed t<* exinninr the flijorcjcent In- 
tensity of both the reporter dye (PAM) and 
the quenching dye (TAMRA). The fluorescent 
intensity of the quenching dye # TAMHA, changes 
very uttl« ovetr the course of the PCR ampHfb 
cation (data not shown). Therefore, the Intensity 
of TAM11A dye emission serves as an uiternal 
slandard with which to noi'tuuHxc the reporter 
dye (l : AM) emission variations. The software cal- 
culate* a value termed AKn (or ARQ) using the 
following equation: ARn - (Hn J ) (Rn"), where 
Kn 4 eml»Hlun intensity of rci^oricr/emission in- 
tensity of quencher al any given time In a rone 
tioti tube, and Ru " emission intensility of tc- 
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poncr/cmlsstoi) IhleiiMly of quencher measured 
prior to I'CK amplication in ihnr same reaction 
tube. \'or the purpose of quantitation, the Usi 
three data points (ARm) collected during the. ex> 
tension step for each KIR cycle were analyzed. 
The nucleolytic degradation of the. iiyoikhy^tion 
pfobe occurs during ihe "extension phase of rut, 
and, therefore, reporter fluorescein cuuhmum hi- 
creascs during this time. :jtik Ujjcc data point* 
were averaged for each KIk cycle and the mean 
value for each was plotted in an "ainpliilcation 
plot" shown In J'igurc 3 A. The AKn mean value is 
plowed on the j^axls, and time, represented hy 
cycle number/is plotted on the A-axis. During the 
early cycles erf the T'CU amplification, tin* AKn 



vaJue remains at base line When sufficient 'hy- 
bridisation probe has been cleaved by the Tiu) 
]X>iymerase nufloifiC activity, the intensity of re- 
porter fhiorcAccm emission iiuvivutfeb* Mosl \>CA\ 
amplifiudjom reach n plaieau phone of reporter 
fJuureHViH emission if the reaction is carried out 
to high cycle numheiN. The amplifirallnn plot lit 
examined euily in Urn reaction, at a point thai 
lepiesents Hu« log phase of piodwN arrmnula> 
tion. This is done by assigning an aihiljiuy 
threshold thai is based on the variability of the 
base-line data. In Hgure 1A, the threshold was set 
al 10 standard deviation* above, the mean of 
base lln<! emhmou talculated from tydc* 1 lo 1 5. 
Once the threshold is chosen, the point at which 
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Figure 1 PGR product detection in real time. {A) The Model 7700 software will con:Umct amplification plot* 
from the extension phase fluorescent emission data collected during the PCR amplification. Tlie standard de- 
viation is determined Irom the data points collected from Ihe base line of the amplification plot c, values are 
calculated by determining the point at which the fluorescence exceeds a threshold limit (usually 10 times the 
standard deviation of the base line). (B) Overlay ot amplification plots of serially (1:2) diluted human genomic 
DNA samples amplified with fi-actin primers. (6 Input DNA concentration of the samples plotted versus C T . All 
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the amplification plot crossed the threshold is civ 
fined as C r . C, is reported us Ihc cycle number a\ 
this point. As will be demons! rut od, I hi* C, value 
lit pi edict! ve of ihc quantity of input target. 

Ox Values Provide a Quantitative Measurement* o>' 
Input Target Sequences 

Pigurc IB shows amplification plot* of ]£**<]i¥fc»»'- 
ent PGR amplifications overlaid. *l*hc amplifou* 
tions were performed on a 1:2 serial dlhitloft -id 
human genomic DNA. The amplified targe.1 w:u 
human p actin. The amplification plohs shift to 
the right (to higher threshold cycles) n* the input 
target quantity is reduced. 'Jinr. is expected ho- 
cuum rtittCtlnriK with fewer starring cophtK of the 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of 10 stan- 
dard deviations above the base line was used to 
determine the C r values. Figure 1C represents the 
C r values plotted versus the sample dilution 
value. Each dilution was amplified, in triplicate 
VCM Amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The Cr values decrease linearly with increas- 
ing target quantity. Thus, Cr values can be used 
as « quantitative measurement of the input target 
number. It should be noted that the amplifica- 
tion plot for the 15.6- ng sample shown hi Mgurc 
IB does not reflect tht* same fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15.6-ng sample also achieves endpolM pla- 
teau at a lower fluorescent vaJuc than would be 
expected based on the input -I >N A. This pheiuwi* 
cnon has been observed occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
still under investigation.. It is important to note 
that the flattened slope and early plateau do not 
impact significantly the calculated O, value us 
demonstrated by the ftl on Die line shown in 
Figure. 1 C. All triplicate amplifications resulted in 
very similar d- values— ihc standard deviation 
did not exceed 0,5 for any dilution. This experi- 
ment contains a > 1 00,000-fold range Of Input tar- 
get molecules. Using C, values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. ol iluorcsccnl in- 
tensity measurement of the A1H l»rlsm 7700 Se- 

aora 



mt»tits over n very large r;oi$»<* /if r?*Ulivf» iiartlnp, 
target quantities. 

Sample Preparation Validation 

Several parameters influence the eflleleury nf 
l>c;R amplification: magnesium and sail conceit' 
tr at ions, reaction conditions (i.e., time and tem- 
perature), PCK target size and composition, 
primer sequences, and sample purity. All of The 
above factors are common to a single i'CR assay, 
except sample to sample purity, in an effort to 
validate the method of sample preparation for 
the lacior Vill assay, PCk amplification reproduc- 
ibility and olflciency oi 10 replicate sample 
pro| orations were examined. After genomic DNA 
was prepared from the 10 replicate samples, the 
DNA was quantified by ultraviolet spectroscopy, 
Amplifications were performed analyzing p-aciln 
mine, content in 100 and ZS n% of total genomic 
UNA. Each VCR amplification was performed in 
triplicate. Comparison of C r values for each trip, 
itcate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Tabic 1). therefore, each ol the triplicate pcu 
amplifications was highly reproducible, demon- 
strating that real time VCR using this instrumen- 
tation introduces minimal variation Into the 
quantitative J'CU analysis. Comparison of the 
mean Gj values of the. 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for f-t-actln gene quantity. The highest C; T 
difference between any of rhe? samples was 0.fc>5 
and 0.71 for the 300 and 25 ng samples, respec- 
tively. Additionally, the amplification «f cadi 
sample exhibited an equivalent rale of fluores- 
cent emission intensity change per amount of 
DNA target analyzed as indicated by similar 
slopes derived from Ihc sample dilutions (Pig. 2). 
Any sample containing an excess of a 1'CK inhibi- 
tor would exhibit a greater measured 3-actin O r 
value for a given quantity of DNA. In addition, 
the inhibitor would be diluted along with I hit 
sample in the dilution analysis {Fig. 2), altering 
the expected C r value change. Each sample am- 
plification yielded a similar result in the analysis, 
dcmonslratinglbat this method of sample prepa- 
ration is highly reproducible with regard lo 
sample purity. 

Ouanritadve Analvsis of a Plasmid After 
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Samplo 



no. 



5 
6 
7 

9 
10 

Mean 



Table 1. Hoproduclbfillty of S*mplo Preparation Method 



100 ng 



standard 
mean deviation CV 



18,24 
18.23 
■ 1*.33 
18.33 
18.35 
18,44 
18.3 
18.3 
18.42 
18.15 
18.23 
18.32 
18.4 
18.38 
18.46 
18.54 
18.67 
19 

18.28 

18.36 

18^2 

18.45 

18.7 

18.73 

18.18 

18.34 

18.36 

18.42 

18.57 

18.66 

0 io) 



1«.27 
18.17 



18.39 



18..S5 
18.12 



0.06 
0.06 



18.34 0.07 



18.23 0.08 



18.42 OXM 



18.74 0.24 



0.12 



18.63 0.16 



18.29 0.1 



0.12 
0.17 



0.32 
0,3? 
0.36 
0.46 

0,23 

1.26 

0.66 

0.83 

0,55 

0.65 
0,90 



20.48 

20.S5 

20,5 

20.61 

20.59 

20.41 

20.54 

20.6 

20.49 

20,48 

20,44 

20.38 

20.68 

20.87 

20.63 

21.09 

21,04 

21 .04 

20.67 

20,73 

20.65 

20.98 

20.84 

20.75 

20.46 

20,54 

20.48 

20.79 

20.78 

20.62 



25 ng 



standard 
mean deviation 



20.51 



70.54 



20.54 



20.4 3 



20.68 



20.51 

20.73 
20.66 



0.03 
0,11 
0.06 
0.05 



20.73 0.13 



21.06 0.03 



0.04 



20.86 0.12 



0,07 

0.1 
0.19 



cv 

0.17 

0.54 

0.28 

0.26 

0.61 

0.15 

0.2 

0.57 

0.32 

0.46 
0,94 



tor containing a partial cDNA /or human factor 
viii, pi : 8TM. A scries of transections wan sot 
up using a decreasing amount of the plasuiid v (40, 
4, 0.5, and O.I ng). 1Vvt-my-fV*ur hours poM- 
tranaftri ion, total frhJA w<ts purified from each 
flask ufirlh. p-Aeliii gene quantity wai ehuMrn *in 
it value for normali^i itm of Ke.fiomic ONA con- 
centration from each sample. In this cxpeuiuvut, 
p-actin rciic content should remain constant 
relative to rotal genomic DNA. Figure 3 shown the 
result of the p-actin UNA measurement (100 jig 
total DNA determined ny ultraviolet spectros- 
copy) of each suiiiple. Kach Sample was analyzed 
in triplicate and the mean p-actin Cr values of 
the .triplicates were plotted (error bars represent 
/•.^a rt^iiiflfrt H^Mat,nni 1 h#» ttioh#*ST niffrrrncr 



between anyiwfi simplo moans was 0.9 S C n Ten 
nanograms of totyl DNA of e&eh Sample were aUo 
examined for iVactln. Ilic results n$am showed 
that very similar amounts of genomic DNA were 
present; the. maxim urn mean |i act in value 
difference wa.s 1 ,0. As Figure 3 shows, the rate of 
p-aetlu C r clniMKe between the J00 acid 10-ng 
sajnplc-i was similar (slope values rang« between 
3.56 and - 3.45). This verifies again th«Hhe 
mcthod of sample preparation yields vamptas of 
identical PCR integrity (i.e., no sample contained 
an excessive ainount cif a PCR inhibitor). TTow* 
«vcr, these results; indicate that cadi sample con 
talncd slight differences in the actual amount of 
genumic DNA analysed. Determination of actual 
uuuomic ONA lonccnUaiion was accomplished 
90RH 2026 091 6*6 Tfd 00:SI ZO0Z/S0/ZT 
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21 

so.s 
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16 




1JI 1.4 



t.e kg 1.7 i.fl i.e 
log (ng Input genomic DMA) 



Figure 2 S<« i iple preparation purity. 1 he replicate 
samples shown In Table 7 wore also amplified In 
tripicate using 25 09 of each DNA (ample. The fig- 
uifc shows die input DNA concentration (TOO and 
25 ng) vs. C, In ihr* lirjurp. ih*> 100 and 75 ng 
points for each sample are connected by a line. 



by plotting the mean £*actio O, value obtained 
for «ai:h 100- ng saxnplv wn a p-aciln standard 
\.anve (shown in I'iH- actual genomic 

ONA concentration <>( each sum pit:, f/, was ob 
tallied try extrapolation tt> tllu *-axU. 

Figure. 4 A shows the measured <l.u. f iu»V 
normalised) quantities uf /actor VJIJ pi n hi n id 
ONA fpl^TM) from each of Hu: four transient cell 
lrai^fecUona. Each react Ion contained 100 ng of 
total sample? DNA (as determined by UV spectro* 
copy). 1\&ch sample w;is analyzed in triplicate 



1>C :i< simplification*. Ak shown, pFBTM purified 
,fioic Jhc 20'A cells decreases (mean C, values in- 
cntiirf^j with decreasing amounts of plasmld 
AruittH'Ued, The m«n C t values obtained for 
pFBTM inTlgufc 4A wore plotted cm a standard 
curve comprised u( sciiiilly diluted pF8TM, 
shown .m figure 4R. The quantity uJ pFKTM, 
found in each of the four transections was de 
tcrmintrd by extrapolation to the * axlt; of the 
standard curve In Figure 41*. 'Il>wc uncorrected 
M values, b, for pHfJ'M were normaN^d to del er- 

mine the a dual amount of pl'81M found per 100 
ng of genomic ONA by using the equation:. 

h X 10 0 tig actual pI-rTTM copies ner 
^~ r 100 ng of genomic DNA 

where a --actual genomic DNA in u sample and 
w pl ; H'l*M copies from the standard curve. The 
normalised quantity of pF&TM per 100 ng of ge- 
nomic ONA for each of The four irana'facWon.s b 
jthown in Figure 4J J. 'Hi cm; rouilJi Allow Uiai the 
quantity of factor Vlll plasiulU associated wiih 
the 2v3 eellN, 24 hr after iransfwli<«i, decioise.v 
with decreasing pltfMuid cuui.«iuiaUon used In 
the transection. The quantity of pFBTM associ- 
ated Willi 293 cells, after transfeaion with 40 jxg 
of piasmid, was 35 pg per 100 ng genomic PNA. 
Tills results In -520 plasudd copies per cell. 



■1 



25 



23 



21 



20 



27.79 * <n«Vr»(. 1 



40 M9 

— • 0 J |to 
• a 0.1 pg 



CM* 



1 



M 1.8 
leg (ng Input ONA) 

Figure 3 Analyil* uf tmnsfectcd cdl DNA quantity 
and purtty- I he DNA preparations* of Ihu four 293 
cell transections (40, A, 0.5, and 0.1 fxcj of pF8TM) 
were analysed for the 3-actln gene* 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transferred, the p-actln 
C T values are plotted versus the total Input DNA 

**r* t r -a ll A n 



DISCUSSION 

We have described a new method for quantis- 
ing gene copy numbers using real-time analysis 
of PCJR axnpJlficatlnnx. Real-time PCK is compai- 
Ible with dthw of the two KIR (KT-PCR) ap- 
proaclio: (1) quantitative coinfxtiitivc where an 
inteinal wn»|/cihor for each target sequence is* 
used for normaliKation (data not shown) or (2) 
quantitative comparative PCK using a nuuuttli^a- 
ttou gene contained within the sample (i.e., (3-nc- 
tin) ox a "housekeeping" gene for RT-PCK. If 
equal amounts of nucleic ue)d are analyzed for 
each sample and if the amplification cffkir.m.y 
before quantitative analysis o identical for cadi 
sample, the tnrernul cuntuil (iiurmaH^linu gene 
or competitor) should give equal Mgnals for fll) 
samples. 

The real-time PCtt method offers scvcnil ad- 
vantages over the other two methods currently 
employed (see the tntroduction). First, the real- 
time PGR method is performed in a doscd-tube 
system and requires no post-PCR manipulation 
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F] D iire 4 Quantitative flnolysi/c of pFSTM in transacted cclb. (.4) Amount of 
plasmid DMA used for (he transfectlon plotted against the mhhim C, value deter- 
mined for pffiTM rcmalniny fa b r aitcr transection. (0,C) Standard curvr* of 
pPATM *nd P-acdn, respectively. pf8TM ONA (0) and genomic C>NA (Q were 
dilutftd fiArially 1 ;S before amplification with the appropriate primers. The p-actin 
standard curve wa* usod to normalise the results of A to 1 00 txtj of genomic DNA- 
(0) The amount of pF8TM preseni per 1 00 ng of genomic DNA. 



of sample. Therefore, I In* potent**) for PGR con- 
tamination in the laboratory is reduced because 
amplified products can In* aiialywd and disponed 
of without opening tho ru ^ct ion tubes. Second, 
this method suppoil* the umi ofa tiorriiiilixtition 
gene (Lc, P-actin) for quantitative PCR or house- 
keeping genes for quantitative RT-l'Clt controls. 
Analysis is performed in real time during the log 
phase of product accumulation. Analysis during 
Iuk phase permits many .different genes (over a 
wide input target range) to be analyzed simulta- 
neously, without concern of reaching miction 
plateau at different Cycles, Tius will make imilll- 
gens analysis assays much caMv.i u> develop, be- 
cause individual internal competitor will not bc 
necded for ench gene under analysis Third, 
sample throughput will iittieasc dianialicaUy 
with the ticw method because there is no post- 
PCR processing time. Additionally, walking In a 
96-wcll format h highly compatible with auto- 
mation technology, 

The real-time PCR method is highly repro- 
ducible. RepHcatc amplifications can be analysed 



for c^ach sample minimizing ]>otcntittl error. The. 
sysiitm allow* for ;> very large assay dynamic 
range (approaching 1, 000,000 -fold starting tai- 
gel). Uwlug a standard curve for the. target oi in- 
terest, relative copy number values can be deter- 
mined for any unknown sample. Fluorescent 
threshold values, C, v c^id air. linearly with rela- 
tive DNA copy numbers. Real time quantitative 
KT*I'CR methodology (Gibson et al v this iwuft) 
has also bcein dttvetlopcd. Finally, real time quan- 
titative I'CU methodology can be used to develop 
high-throughput screening assays for a variety of 
applications [quantitative gene c*pi«H>iun (KT- 
PCR), gene copy a^ays (fieri, HIV, etc.), geno- 
typing (knockout mouse, analysis), and Imniuno- 
POllj. 

Real-time POU may also l>e performed using 
intercalating dyes (Higuchi ct ul. WXA) such as 
ciJviditim bromide. The fluorogenic probe 
method offers a major advantage over inter- 
calating dyes- -greater specificity (i.e., primer 
dl meres and nonspecific PCR products are not de- 
tuned). 
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METHODS 

Generation uf <t PUsmld Containing a Partial 
cDNA for Human Factor VIII 

Total UNA vy<t» hiirvcMexl (UN A*ol U front T«l To*i, Inc., 
r-ncndswood, TX) from cvll> imnsfecled with a factor VHI 
rxpreasluu vector, pClS^lk^U (Kaum vt hi. 1VH6; Gor. 
man c.l al. 1990), A factor VIII partial cl )N A sorpicntv WAS 
^in«rnud by IXV \>C:\t KJoitcAmp |<Z iTth UNA Uf.U 
(pan NtwiR-ov/*), rt Applet) biosystems, l-'ostei ^<ity, c;A)J 

uslu£ Ok* l'C:u primers FKfor *»tul Pftrcv (primer Krqueiirr* 

arc shown below). The ampHcon w;is rcampbfied udnft 
modified I'tWor and FVrcv printers («]1|m*iuUk1 with rtwwlil 
and /f/irdlll restriction site sequences at the V cm If and 
clonal i"t<> jXiKM- 3Z (rromofcu Corp,, Minlmon, WI). The 
resulting clonr, pVSTM, was used lor transient transfealon 
erf 293 cells. 



Amplification of Target DNA ami Dcleciion of 
Amplicpn Factor VIII Masmid DNA 

(pr'HTM) was wnpltflud with the piimfi* VHfur 5'-C(;C- 

<rrcK;(;AAUA(j:jxjA(;uiOTC.3' and parev 5*-AAA<;crr- 

tUOCCrroaATCi<j rACJCi-3'.Hic rvneiUm piodocvd « 
np K:k product. The forward primer wo* de>h;ued to too 
ognlxv a unique M-ipu'tui- Ann id Jn the 5' untranslated 
region of the paiem uClS2,eKZ5l> pldMiiid nml therefore 
does not h'uiknUa: and amplify the human factor VIII 
gene. 1'rimoffi woro chostm with the av»ytaiirr*'nf live* com- 
puter program Oligo *WJ (National biosciences, lnc„ Ply- 
mouth, MN). The human p-aetl« gene was Amplified with 
the piimeo li-t*ftir> forward primer 5* TCACXV.lACA< T( IT 
GCCCATCI'AGGA-.V and fci-actin reverse piirncr V.CA<;. 

CGGAACCXicrr<^'ri't;<:cJiATGG-3'. The reaction pro- 
cuicco a 295- hp i<;u product. 

Amplification reaction* (SO pj) comHined a DNA 
sample, I0x VCAK Kuffe.r H (& p.]), 200 jim UAH', dCTl\ 
dGfl\ and 400 tiM rill'IT, 4 mx< Mfl<;i ? , i.a.s Units Ampll 
7W<y r;NA poiymciasc, 0,5 unit AiiipKrnsv uracil fV-fjly- 
uuayluMV <UNC), 50 pinole of each footoi VIII (irlmei, urrd 15 
piuolt* of iiuoh u actio p< liner, 'Hut lOat-tUMK ahu inmlatucd 
OIW Of the following detectlmi proUv (WIO iim rprh): 

i^jifi>i><- A'fi'AtwOAfirrjrrjticuctyrcjcrntrnTCTcrr- 

GCCTT(TAMRA)p 3' fl ud p-aetin proU: 5 f (rAM)ATGf.X;c;- 
X(TAMl*A)CCCCCATCCCATCp-.T where p indicates 
phoAplinrylAfjon And X tndlcotcs a linker arm nuclcKitide. 
Reaction IuIk'» wrtx- Mi<:n)An\p Optii^nt Tutx-S (part AUm- 
IhtNKOI ttOXi, I»crkhi Ulniur) tiiai wore fruuiwl (at J»erWn 
F.lnicr) to |»rvvc-iil light from /cflccllng, Tube eops were 
Simitar in MicroAmp i;nj)s Ixil specially dcaigncd to pre- 
vent llj;l 1 1 svutkTMijj, All <il Ui<* Vi'M tiUMiNutnhhU'* were 5Vi>v 
plied 1>y PK Applied l^oAyM^ni? (I'^iMor 01 ty, CA) except 
the factor Via primers, whk'h wcie synthesized nt Cvnen 
lech, Inc. (South Sati Francisco, CA). Probes ww 4ic*sl(*nod 
tiding the Olj;;o 4.0 .software, follow! d^; gtitdellnev 

&C5te<i in tnc MculeJ 7700 .sequence Peuruu- liMtiuiiieiil 
manual, briefly, probv T m iJinuUl he a! Jcast 5 U C higher 
than the anneullnx leuipvjdlurc u.ted <1urlfi^; thfrmitl cy- 
rltng; primerti sho\iU1 nut fuim M«iblv duplexed with the 
probe. 

The therm*! **ycllng CunditKuvs InrlutlvU 2 mln at 
50 4 X": and 10 niin al 9S*C. 'Hica-mal cycling procrrded with 



rt-actionii were performed in Ihf Model 77011 Sequence De- 
lc*«*tor 0*t Applied Uluvy^lumv), w»ltirh cnnuUu ,* Cciv- 
Allip l*<:U Syttlum yOOO. Ut;a<:llon condition* w<*rf* pvo< 
^ruiiiiucU on .i VuvfQt Mflcinti^h V100 (Apple O.irnpiitrr, 
?»anta Clara, CA) linked ttirvcily to the Model VVOn to* 
c)ucnev IXtlffCtor. AiiMly^U of data wn* aKo |M«rf/irmed on 
the Mm lntosh computer, f *.ollncilon and analysis «>fiwaro 
wi« devckj|ied hi Applied l^losyn twins. 



Transfection of Cell* with Factor VIII Coimruci 

JViur I17.S flaski; of 293 cells (ATCC CULL 1573), ?> human 
fetol kidney »i»Npvn^ion cell line, were h«'wh to 80% ton- 
Ituency A0w1 Iransfcrted pl-WI*M. Cells wero grown in the 
following incdlfli S0% H/\M^ HI 2 wllhoui GMT, ,^0% h»v 
glucose Dulbeaw^ modified Ragle medium (UMIiM) with, 
otn glycine wilh sodium bicarbonate, 10% ietal Uivine 
serum, 2 oim L^lutdiniDt, and 1% pcnidllin-atrcptomy- 

Khtf The medio ww clunffed 30 mln hcf<.m* Hie* Iransfec 
lion, pJHJTM DMA ^rnountt* of AO, 4, OS, and 0,1 w w 
iidiled h> 1 V S ml of ft solution conialnlng 0.125 m CkiGl;- 
«nd 1 x Ht'^KI. The four mixtures wtrrc left at room tern- 

I.K'.rptun- (<tt TO mln and then «td<lv*l Hropwtftc- to the cells. 
TJiv wvieuK-ul^led al 37°C"«nd 5% C0 3 for 24 lu, 

wujihed with PUS, ,nieJ ro««!»pcndcd In PUS, The re*n« 
jn-rniv:d cclU were divided into »liqunt» und DNA wai c>*- 
trHcted luiinedlutely usins IhvQIAunip KI«»ik1 Kit (Qli(*en. 
aiet^m»rth, <.:A). UNA wus duted Into 200 of 20 
Trls-I ICJ ol pllH.0. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP- 1 and WISP-2, that are up- regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISPS , these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tet racy line repress ible promoter, and («) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q243. WISP'l genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to >30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- v 
pressed (4- to > 40- fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and Us DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the ceil membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-3/3) resulting in an increase in 
/3-catenin levels. Stabilized 0-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (A PC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). A PC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, gooseco'id, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISPS. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 jxg of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 fxg 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP- J were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
. coding full-length mouse and human WlSP-2 were isolated by 
screening a C57MG /Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA, PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /xM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeIed sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WlSP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified T and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2 (Act) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene Copy 
number or RNA expression level. The W75P-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B), Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on )3-catenin levels (13, 14). Expression of WISP-1 was 
up- regulated approximately 3-fold in the C57MG/ Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ^27,000 (M T 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-U and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 ^g) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- /-specific probe 
(amino acids 278-300) or a 190-bp WISP-2-spccific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human 0-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-I (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-L 

Identification of WISP-3. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
" human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and' 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISPS are novel sequences; 
however, mouse WISP- 1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-l (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-L All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fic. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3,. whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerizatiori and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. {A , C, £ , and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of WISP- J is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and 0), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of.WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM21 lze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PGR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDrceli lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 fig) 
digested with £coRI (WISP-1) ox Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 



Cell Biology, Medical Sciences: Pennica et al. 



i . 



lifiiilaaiailliiifliilJiia 



WISP-1 



WISP-2 




WISP-3 



liiiiiiiiiiiiiiiii.i.iii 



Tumor Number 

Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means i SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-foId) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov t a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of. 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through )3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov y WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may. bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v j33 serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory ceils at the tumor 
interstitial interface secrete TGF-01, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply. WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysts of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP ^2. was localized to chromosome 20ql2-20q 13, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISP s as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPMI growth medium. T-ceil -proliferation assays were 
done essentially as described- 0,21 . Briefly, after antigen pulsing (30p.gmr' 
TTCF} with tetrapeptides (l-2mgmr'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glutaraJdehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T- cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 u,Ci of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 jxg TTCF with 0.25 p.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for I h at 37 °C. 
Glycopeptide digestions. The peptides HIDNEEDI, HJDN(N- glucosamine) 
EEDI and HIDNESDI, which are based on the TTCF sequence, and 
QQQHLFGSN^DCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic ( Promega) digestion 
of 5mg reduced, carboxy-methylated human transferrin followed by 
concanavalin A chromatography". Olycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N- terminal sequencing. The iyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mUral"' pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgrnl" 1 a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. I). 
One important role of FasL and Fas is to mediate immune- . 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine -rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6 ' 7 , Apo3L/TWEAK 8,9 , or OPGL/TRANCE/ 
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RANKL 10 " 12 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL- transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
Ll±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 pLgml" 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation - 
induced cell death (AICD) of' mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results' 3 , activation 
of interleukin-2 -stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes M4 "'*'. Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half- maximal inhibition at — ljigml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Pas- Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune -cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune -cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the A/-linked glycosylation site (asterisk) are 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL, peripheral blood 
lymphocyte. 
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Figure 2 Interaction of OcR3 with FasL. a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (sojid line, shaded area). TNFRl-Fc (doned line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference (P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE, phycoerythr in- 
labelled cells, b. 293 cells were transfected as in a and metabolicaliy labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1, DcR3 or Fas. 
. c. Purified soluble FasL (sFasL) was immunoprecipitated with TNFR1-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Rag-tagged sFasL was 
incubated with OcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR)" in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18- fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3-based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 
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Figure 3 Inhibition of FasL activity by DcR3. a t Human Jurkat T leukaemia ceils 
were incubated with Rag-tagged soluble FasL (sFasL:.5ngml~ 1 ) oligomerized 
with anti-Flag antibody (0.1 u.gmryin the presence of the proposed inhibitors 
DcR3-Fc. Fas-Fc or human IgGl arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b, Jurkat cells were incubated with sFasL-Flag plus anti-Flag antibody 
as in a, in presence of 1 n-g ml* 1 DcR3-Fc (filled circles), Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2. 
followed by control (white bars) or anti-CD3 antibody {filled bars), together with 
phosphate-buffered saline (PBS), human IgGl, Fas-Fc, or DcR3-Fc (lO^gmr 1 ). 
After 16 h, apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d. Peripheral blood natural killer ceils were incubated with 51 Cr- 
tabelled Jurkat cells in the presence of OcR3-Fc (rilled circles), Fas-Fc (open 
circles) or human IgGl (triangles), and target-cell death was determined by 
release of 51 Cr (mean ± s.d. for two donors, each in triplicate).- 
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we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 18xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAG) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clones insert We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3-linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre* of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand-family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2,19 . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 



a b d 




Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c, d. f. g. h, j. k, r), seven squamous-cell carcinomas (a, e, 
m, n. o, p. q), one non-small-cell carcinoma (b), one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means z s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c, in situ hybridization 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd), the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing OcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's f-test 
comparing each marker with DcR3. 



NATURE| VOL 396| 17 DECEMBER 1998 1 www.naturc.com 



Nature © Macmillan Publishers Ltd 1998 



701 



tetters to nature 



FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-l has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L". Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG 3 , which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. Q 



Methods 

Isolation of DcR3 cDNA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 153357 1, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL' (2 fxg), together with pRK5 encoding CrmA 
(2u,g) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fcor TNFRI-Fc and then with phycoerythrin-conjugated 
streptavidiri (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected ceils by DcR3-Fc; as these cells express little FasL (data 
not shown),, it is possible that DcR3 recognized some other factor that is 
expressed constitutiyely on 293 cells, 

Immu no precipitation. Human 293 cells were transfected as above, and 
metabolically labelled with ( 35 S]cysteine and [ 35 S] methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (l0u,M), 
the medium was immunopreci pita ted with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5p.g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL ( 1 jig) (Alexis) was incubated 
with each Fc-fusion protein (1 u,g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25 ng) was 
incubated with buffer or with DcR3-Fc (40 u,g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 \x\ aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc. 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgG 1 before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u.g ml"' ) for 24 h, and cultured 
in the presence of interleukin-2 ( 100 U ml"') for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later by FACS analysis of annexin-V-binding of CD4 + cells". 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 5l Cr-loaded Jurkat cells at an effector- 
to- target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of ^'Cr in effector- target co- 
cultures relative to release of s, Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluororaetry. Amplification was determined by quantitative PCR 1 * 
using a TaqMan instrument ( ABI ). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAG 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to —500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' - ( FAM - ACACG ATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 (iCT \ where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous & colt 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated With antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurtum. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane -bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of 5. typhimurium and E. co/i ,J ~ 8 is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded fi- 
shed (p3 and p8-p 12) spans both arms of the L, with a domain of a 
a- plus P- type structure ((31, 02, P4-P7, al and al) on one side 
(within arm I) and a domain of mostly a-helices (a3^a9) on the 
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Figure 1 - Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a, towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT* 9 . N, amino terminus: C, C 
terminus. 
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I 



. Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-cha in -reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erfaB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661 -666, 1 998. 
© 1998 Wiley-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minute's 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et al, 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( 1 1 q 1 3), and erbB2 ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns et al. 1992; 
Schuuring et al, 1992; 91amon et al, 1987). Muss et al (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al. (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard cup*& is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
. novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson etai, 1996; Heid et 
al, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et al. (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et aL, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et aL, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C t 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 rn\ 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and erbBl), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter Q (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring Q and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e.; lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et aL, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N'\ and is determined as follows: 

copy number of target gene (app, myc, ccndl, erbBl) 

N = ' . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs OHgo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, Micro Amp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et aL (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10~ 7 (10 5 copies of each gene) to 
10" 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCI 2 , 1 .25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates Q and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 



RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbB2 proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 1 q21 .2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as I0 2 copies or as 
many as 1 0 5 copies. 

Copy-number ratio of the 2 reference genes (app and alty 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 5 (A9), 10 4 (A7), 10 3 (A4) to 10 2 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C, (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom; 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions {app, 21q21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ah, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary ! breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast- tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and erbi?2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 1 8 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc; 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 ±0.19) forer£>B2. Since N values 
for myc, ccndl and erbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1 .6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbB2 gene dose in breast-tumor DNA 

myc, ccndl and erbB2 gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (TI 18). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbBl and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 1 7q21 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 108 primary breast tumors. A 
perfect correlation between the results of real -time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N ^ 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND crbhl GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2-4.9 




myc 


0 


97(89.8%) 11 (10.2%) 


0 


ccndl 


0 


83(76.9%) 17(15.7%) 


8(7.4%) 


erbBl . 


5 (4.6%) 


87 (80.6%) 8 (7.4%) 


8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a, PCR method developed for the 
quantification of gene over-representation in tumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the AB I Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C t value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C, ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disa vantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et ai, 1989). However^ FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (0 Chromosome regions 4qll-ql3 and 21q21,2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (//) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1 992; Borg et ai, 1992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- . 
tion, confirming the findings of Borg et ai (1992) and Courjal.e/ al 
(1997). (i'v) The maxima of ccndl and erbBl over-representation 
were 1 8-fold and 1 5 -fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 18 (E12, C6, black squares), Tl 33 (G 1 1 , B4, red squares) 
and T145 (A8, C8, blue squares). Given the Q of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experimenL Triplicate plots were performed for each lumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Berns et ai, 1992; Borg et al, 1992; Courjal et 
al. 1997). (v) The erbB2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et al, 1 995; Deng et al, 1 996; Valeron 



et al, 1996). Our results also correlate well with those recently 
published by Gelmini et al ( 1 997), who used the TaqMan system to 
measure erbB2 amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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TABLE 11 - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS 1 
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'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Hccndl/alb) is determined by dividing the average ccndl 
copy namber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erb\$2) observed by means of real-time 
quantitative PCR as compared with Southern-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysemy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et ai, 1992; 
Slamon e/ a/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbB2 (but not of the other 2 proto- 
oncogenes) in several tumors; erbBl is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- . 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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PEG aldehyde via reductive alkilation. Bioconj. Cherru 5, 133-140 (1994). 

31. Jin, R, Yang, R., Marsters, S., Bunting, S., Wurm, F., Chamow, S., and 
Ashkenazi, A. Protection against rat endotoxic shock by p55 tumor necrosis factor 
(TNF) receptor immunoadhesin: comparison to anti-TNF monoclonal antibody. J. 
Infect Diseases 170, 1323-1326 (1994). 

32. Beck, J., Marsters, S., Harris, R., Ashkenazi, A., and Chamow, S. Generation of 
soluble interleukin-1 receptor from an immunoadhesin by specific cleavage. Mol 
Immunol 31, 1335-1344 (1994). 

33. Pitti, B., Marsters, M., Haak-Frendscho, M., Osaka* G., Mordenti, J., Chamow, S., 
and Ashkenazi, A. Molecular and biological properties of an interleukin- 1 
receptor immunoadhesin. Mol Immunol. 31, 1345-1351 (1994). 

34. Oksenberg, D., Havlik, S., Peroutka, S., and Ashkenazi, A. The third intracellular 
loop of the 5-HT2 receptor specifies effector coupling. J. Neurochenu 64, 1440- 
1447.(1995). 

35. Bach, E., Szabo, S., Dighe, A., Ashkenazi, A., Aguet,JM., Murphy, K., and 
Schreiber, R. Ligand-induced autoregulation of IFN^-y receptor P chain expression 
in T helper cell subsets. Science 270, 1215-1218 (1995). 

36. Jin, H., Yang, R., Marsters, S., Ashkenazi, A., Bunting, S., Marra, M., Scott, R., 
and Baker, J. Protection against endotoxic shock by bactericidal/permeability- 
increasing protein in rats. J. Clin. Invest 95, 1947-1952 (1995). 

37. Marsters, S., Penica, D., Bach, E., Schreiber, R., and Ashkenazi, A. Interferon y 
signals via a high-affinity multisubunit receptor complex that contains two types 
of polypeptide chain. Proc. Natl . Acad. ScL USA. 92, 540U5405 (1995). 

38. Van Zee, K., Moldawer, L., Oldenburg, H., Thompson, W., Stackpole, S., 
Montegut, W., Rogy, M., Meschter, C, Gallati, H., Schiller, C, Richter, W., 
Loetcher, KL Ashkenazu A ., Chamow, S., Wurm, F., Calvano, S., Lowry, S., and 
Lesslauer, W. Protection against lethal E. coli bacteremia in baboons by 
pretreatment with a 55-kDa TNF receptor-Ig fusion protein, Ro45-208 1 . J: 
Immunol 156,2221-2230 (1996). 

39. Pitti, R., Marsters, S., Ruppert, S., Donahue, C, Moore, A., and Ashkenazi, A . 
Induction of apoptosis by Apo-2 Ligand, a new member of the tumor necrosis 
factor cytokine family. /. Biol Chem. 271, 12687-12690 (1996). 
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40. Marsters, S., Pitti, R., Donahue, C, Rupert, S., Bauer, K., and Ashkenazi, A . 
Activation of apoptosis by Apo-2 ligand is independent of FADD but blocked by 
CimA. Curr. Biol 6, 1669-1676 (1996). 

41. Marsters, S., Skubatch, M., Gray, C, and Ashkenazi A . Herpesvirus entry 
mediator, a novel member of the tumor necrosis factor receptor family, activates 
the NF-kB and AP-1 transcription factors. J. Biol Chem. 272, 14029-14032 
(1997). 

42. Sheridan, J., Marsters, S., Pitti, R., Gumey, A., Skubatch, M., Baldwin, D., 
Ramakrishnan, L., Gray, C, Baker, K., Wood, W.L, Goddard, A., Godowski, P., and 
Ashkenazi A. Control of TRAIL-induced apoptosis by a family of signaling and 
decoy receptors. Science 277, 818-821 (1997). 

43. Marsters, S., Sheridan, J., Pitti, R., Gumey, A., Skubatch, M., Balswin, D., Huang, A., 
Yuan, J., Goddard, A., Godowski, P., and Ashkenazi, A. A novel receptor for 
Apo2L/TRAIL contains a truncated death domain. Curr. Biol 7, 1003-1006 (1997). 

44. Marsters, A., Sheridan, L, Pitti, R., Brush, J., Goddard, A., and Ashkenazi, A. 
Identification of a ligand for the death-domain-containing receptor Apo3. Curr. Biol 
8, 525-528 (1998). 

45. Rieger, J., Naumann, U., Glaser, T., Ashkenazi. A ., and Weller, M. Apo2 ligand: 
a novel weapon against malignant glioma? FEBS Lett 427, 124-128 (1998). 

46. Pender, S., Fell, J., Chamow, S., Ashkenazi, A ., and MacDonald, T. A p55 TNF 
receptor immunoadhesin prevents T cell mediated intestinal injury by inhibiting 
matrix metalloproteinase production. J. Immunol 160, 4098-4103 (1998). 

47. Pitti, R., Marsters, S., Lawrence, D., Roy, ICischkel, F., M., Dowd, P., Huang, A., 
Donahue, C, Sherwood, S., Baldwin, D., Godowski, P., Wood, W., Gumey, A., 
Hillan, K., Cohen, R., Goddard, A., Botstein, D., and Ashkenazi, A. Genomic 
amplification of a decoy receptor for Fas ligand in lung and colon cancer. Nature 
396,699-703(1998). ; 

48. Mori, S., Marakami-Mori, K., Nakamura, S., Ashkenazi, A ., and Bonavida, B. 
Sensitization of AIDS Kaposi's sarcoma cells to Apo-2 ligand-induced apoptosis 
by actinomycinD. 1 Immunol 162, 5616-5623 (1999). 

49. Gurney, A. Marsters, S., Huang, A., Pitti, R., Mark, M., Baldwin, D., Gray, A., 
Dowd, P., Brush, J., Heldens, S., Schow, P., Goddard, A., Wood, W„ Baker, K., 
Godowski, P., and Ashkenazi A. Identification of a new member of the tumor 
necrosis factor family and its receptor, a human ortholog of mouse GITR. Curr. 
Biol 9, 215-218 (1999). 
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50. Ashkenazi, A ., Pai, R., Fong, s., Leung, S., Lawrence, D., Marsters, S., Blackie, 
C, Chang, L., McMurtrey, A., Hebert, A., DeForge, L., Khoumenis, L, Lewis, D., 
Harris, L., Bussiere, J., Koeppen, EL, Shahrokh, Z., and Schwall, R. Safety and 
anti-tumor activity of recombinant soluble Apo2 ligand J. Clin. Invest. 104, 155- 
162(1999). 

51. Chuntharapai, A., Gibbs, V., Lu, J., Ow, A., Marsters, S., Ashkenazi, A., De Vos, 
A., Kim, K J. Determination of residues involved in ligand binding and signal 
transmission in the human IFN-a receptor 2. J. Immunol 1 63, 766-773 (1 999). 

52. Johnsen, A.-C, Haux, J., Steinkjer, B., Nonstad, U., Egeberg, K., Sundan, A., 
Ashkenazi, A., and Espevik, T. Regulation of Apq21/TRAIL expression in NK 
cells - involvement in NK cell-mediated cytotoxicity. Cytokine 11, 664-672 
(1999). 

53. . Roth, W., Isenmann, S., Naumann, U., Kugler, S., Bahr, M., Dichgans, 

. Ashkenazi, A., and Weller, M. Eradication of intracranial human malignant 
glioma xenografts by Apo2L/TRAIL. Biochem. Biophys. Res, Commun. 265, 479- 
483 (1999). 

54. Hymowitz, S.G., Christinger, H.W., Fuh, G., Ultsch, M., O'Connell, M., Kelley, 
R.F., Ashkenazi, A. and de Vos, A.M. Triggering Cell Death: The Crystal 

. Structure of Apo2L/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
563-571 (1999). 

55. Hymowitz, S.G., O'Connel, M.P., Utsch, M.H., Hurst, A., Totpal, K., Ashkenazi, 
A., de Vos, A.M., Kelley, R.F. A unique zinc-binding site revealed by a high- 
resolution X-ray structure of homotrimeric Apo2I7TRAIL. Biochemistry 39, 633- 
640 (2000). 

56. Zhou, Q., Fukushima, P., DeGraff, W., Mitchell, J.B., Stetler-Stevenson, M., 
AshkenazirA., and Steeg, P.S. Radiation and the Apo2I/TRAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer cells overexpressing cyclinD l. Cancer Res. 60, 2611-2615 (2000). 

57. Kischkel, F.C., Lawrence, D. A., Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi, A. Apo21/TRAIL-dependent recruitment of endogenous FADD and 
Caspase-8 to death receptors 4 and 5. Immunity 12, 611-620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, LS., Wang, H., * Ashkenazi, A., and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol. 1,37-41 (2000). 



59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and Ashkenazl A. 
Interaction of the TNF homologues BLyS and APRIL with the TNF receptor 
homologies BCMA and TACI. Curr. Biol 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazi, A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Biotechniques 29, 506-512 (2000). 

61. Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, K., Shih, D. Mounho, B., 
Hillan, K., Totpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L., Gliniak, B., Bussiere, J., Smith, C, Strom, S., Kelley^ S., 
Fox, J., Thomas, D., and Ashkenazi, A. Differential hepatocyte toxicity of 
recombinant Apo2L/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. : Oiuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 

Koeppen, Ashkenazi A ., and Kim, K.J. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. J. Immunol. 
166,4891-4898(2001). 

63. Pollack, T.F., Erff, M., and AshkenazLA. Direct stimulation of apoptotic 
signaling by soluble Apo2L/tumor necrosis factor-related apoptosis-inducing 
ligand leads to selective killing of glioma cells. Clin. Cancer Res. 7, 1362-1369 
(2001). 

64. Wang, H., Marsters, S. A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yan, 
M., Dixit, V.M., * Ashkenazi A ., and *Grewal, LS. TACI-Ugand interactions are 
required for T cell activation and collagen-induced arthritis in mice. Nature 
Immunol 2, 632-637 (2001). 

65. Kischkel, F.C., Lawrence, D. A., Tinel, A., Virmani, A., Schow, P., Gazdar, A., 
Blenis, J., Amott, D., and Ashkenazi, A . Death receptor recruitment of 
endogenous caspase-10 and apoptosis initiation in the absence of caspase-8. J. 
Biol Ckem. 276, 46639-46646(2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E., Totpal, K., Morlan, J., Schow, P., 
Fong, S., Schwall, R., Sinicropi, D., and Ashkenazi, A T umor cell resistance to 
death receptor induced apoptosis through mutational inactivation of the 
proapoptotitc Bcl-2 homolog Bax. Nature Med. 8, 274-281 (2002). 

67. Miller, K., Meng, G., Liu, J., Hurst, A., Hsei, V., Wong, W-L., Ekert, JL, 
Lawrence, D., Sherwood, S., DeForge, L., Gaudreault, Keller, G., SliwkowskL, 
M., Ashkenazi. A ., and Presta, L. Design, Construction, and analyses of 
multivalent antibodies. J. Immunol 170, 4854-4861 (2003). 
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68. Varfolomeev, E., Kischkel, F., Martin, R, Wanh, H., Lawrence, D., Olsson, C, 
Tom, L., Erickson, S., French, D., Schow, P., Grewal, I. and Ashkenazi, A. 
Immune system development in APRIL knockout mice. Submitted. 

Review articles; 

1 . Ashkeiiazh A., Peralta, E., Wirislow, J., Ramachandran, J., and Capon, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology. LIH, .263-272 (1 988). 

2. AshkenazL A ., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Scu Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Ammann, A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
R., Capon, D., Ward, R., and AshkenazL A . CD4 immunoadhesins in anti-HIV 
therapy: new developments. Int. J. Cancer Supplement 7, 69-72 (1992). 

4. ' AshkenazLA., Capon, and D. Ward, R. Immunoadhesins. Int. Rev. Immunol. 10, 

217-225(1993). 

5. Ashkenazu A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and Ashkenazi, A . 
Inhibition of erythrbid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (NL Abraham, R. Shadduck, A. Levine F. 
Takaku, eds.) Intercept Ltd. Paris, Vol. 3, p. 135-147 (1994). 

7. Ashkenazi. A . Cytokine neutralization as a potential therapeutic approach for 
SIRS and shock. Biotechnology in Healthcare 1, 197-206 (1994). 

8. Ashkenazi, A .; and Chamow, S. M. Immunoadhesins: an alternative to human 
monoclonal antibodies. Immunomethods: A companion to Methods in 
Enzimology 8, 104-115 (1995). 

9. Chamow, S., and Ashkenazi, A . Immunoadhesins: Principles and Applications. 
Trends Biotech. 14, 52-60 (1996). 

10; Ashkeriazi; A ., and Chamow, S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol. 9 ; 195-200 (1997). 

11. AshkenazLA., and Dixit, V. Death receptors: signaling and modulation. Science 
281,1305-1308(1998). 

12. Ashkenazi, A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol. 11, 255-260 (1999). 



13. Ashkenazi. A . Chapters on Apo21/TRAIL; DR4, DR5, DcRl , DcR2; and DcR3. 

Online Cytokine Handbook (www.apnet.com/cvtokinereference/) . 
14; Ashkenazi, A. Targeting death and decoy receptors of the tumor necrosis factor 
. superfamily. Nature Rev. Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazi. A . Apoptosis signaling by App2L/TRAIL. Cell Death 
and Differentiation 10, 66-75 (2003). 

16. Almasan, A. and Ashkenazi. A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine arid Growth Factor Reviews 14, 337-348 
(2003). ■ ' ; 

Book: 

Antibody Fusion Proteins (Chamow, S., and Ashkenazi. A ., eds., John Wiley and 
Sons Inc.) (1999). ! " 

Talks: 

1. Resistance of primary HIV isolates to CD4 is independent of CD4-gp 120 binding 
affinity. UCSD Symposium, HIV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. DBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3 . Results with TNF receptor Immunoadhesins for the Treatment of Sepsis. BBC 
conference on Endotoxemia arid Sepsis. Philadelphia, PA, June 1992. 

4. Immunoadhesins: an alternative to human antibodies. IBC conference on 
Antibody Engineering. San Diego, C A, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1993. 

6. Protective efficiacy of TNF receptor immunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th International Congress on TNF. 
Asilomar, CA, May 1994. 

7. Merferonry signals via a multisubunit receptor complex that contains two types of 
polypeptide chain. American Association of Immunologists Conference. San 
Franciso, CA, July 1995. 

8. Immunoadhesins: Principles and Apphcations. Gordon Research Conference on 
Drug Delivery in Biology and Medicine. Ventura, CA, February 1996. 
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9. Apo-2 Ligand, a new member of the TNF family that induces apoptosis in tumor 
cells. Cambridge Symposium on TNF and Related Cytokines in Treatment of 
Cancer. Hilton-Head, NC, March 1996. 

10. Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 
Orleans, LA, June, 1996. 

1L Apo2 ligand, an extracellular trigger of apoptosis. 2nd Clontech Symposium, 
Palo Alto, CA, October 1996. 

12. Regulation of apoptosis by members of the TNF ligand and receptor families. 
Stanford University School of Medicine, Palo Alto, CA, December 1996. 

13. Apo-3: anovel receptor that regulates cell death and inflammation. 4th 
International Congress on Immune Consequences of Trauma, Shock, and Sepsis. 
Munich, Germany, March 1997. 

14. New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, C A, March 1997. 

15. Immunoadhesins: an alternative to monoclonal antibodies. 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, Jime 1997. 

1 6. Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed Cell Death. Cold Spring Harbor, New York. September, 1997. 

17. Chairman and speaker, Apoptosis Signaling session. IBC's 4th Annual 
Conference on Apoptosis. San Diego, CA., October 1997. 

1 8. Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA, February 1 998. 

19. Apo2 ligand and its receptors. American Society of Immunologists. San 
Francisco, CA, April 1998. 

20. Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 
May 1998. 

21. Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, . 
. \CA, June 1998. 

22. Apo2L as a potential therapeutic for cancer. Gordon Research Conference on 
Cancer Chemotherapy. New London, NH, July 1998, 

23; Control of apoptosis by Apo2L. Endocrine Society Conference, Stevenson, WA, 
August 1998. 

24. Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 
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25. Apoptosis control by death and decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1 999. 

26. Apoptosis control by death and decoy receptors. American Society for 
Biochemistry and Molecular Biology Conference, San Francisco, CA, May 1999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH, June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research . 
Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2L/TRAIL. Cold 
Spring Harbor Laboratory Symposium on Programmed Cell Death.. Cold Spring 
Harbor, NY, September 1999. \ 

30. The Apo2L/TRAIL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

3L Apoptosis and cancer therapy. Stanford University School of Medicine, Stanford, 
CA, Mar 2000. 

32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philadelphia, PA, Apr 2000. 

33. Apoptosis signaling by Apo2L/TRAIL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2I/TRAIL system: therapeutic potential. Cap-CURE summit meeting. 
Santa Monica, CA* June 2000. 

35. The Apo2L/TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. Apoptosis signaling by Apo2L/TRAIL. The Protein Society, 14 th Symposium. 
San Diego, C A, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 
Aug 2000. 

38. Apoptosis signaling and anti-cancer potential of Apo2L/TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2L/TRAIL. Kenote address, TNF family 
Minisymposium, NIH. Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001 . 

41. Preclinical studies of Apo2UTRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 
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42. Apoptosis signaling by Apo2L/TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43. Apo2L/TRAIL: Apoptosis signaling and potential for cancer therapy, Weizmann 
Institute of Science, Rehovot, Israel, March 2001. 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, North Falmouth, MA, Jun 2001. 

45. Targeting death receptors in cancer with Apo2IVTRAIL. Biotechnology 
Organization conference, San Diego, CA, Jun 200L 

46. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Gordon Research 
Conference on Apoptosis, Oxford, UK, July 2001. 

47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. v 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH; Oct 2001 . 

49. * Apoptosis signaling by death receptors. American Society of Nephrology 

Conference. San Francisco, CA, Oct 200 L 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA, Apr 2002. 

51. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Kimmel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2L/TRAIL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53. Apoptosis signaling by Apo2L/TRAIL. (Session co-chair) TNF international, 
conference. San Diego, CA. October 2002. 

54. Apoptosis signaling by Apb2L/TRAIL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jan 2003. 

55. Apoptosis induction with Apo2L/TRAIL. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2L/TRAIL. Hermelin Brain Tumor Center 
Symposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference on 
Targeted Therapies in the Treatment of Breast Cancer. Kona, Hawaii. July 2003. 

58. Targeting apoptosis through death receptors. Second International Conference on 
Targeted Cancer Therapy. Washington, DC. Aug 2003. / 

Issued Patents: 
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1. Ashkenazi, A., Chiamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,329,028 (Jul 12, 1994); 

2. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,605,791 (Feb 25, 1997). 

3. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 5,889,155 (Jul 27, 1999): 

4. Ashkenazi, A., APO-2 Ligand. US patent 6,030,945 (Feb 29, 2000). 

5. Ashkenazi, A., Chuntharapai, A., Kim, J., APO-2 ligand antibodies. US patent 6, 
046, 048 (Apr 4, 2000). 

6. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,124,435 (Sep 26, 2000). 

7. Ashkenazi, A., Chuntharapai, A., Kim, J., Method for making monoclonal and cross- 
reactive antibodies. US patent 6,252,050 (Jun 26, 2001). 

8. Ashkenazi, A. APO-2 Receptor. US patent 6,342,369 (Jan 29, 2002). 

9. Ashkenazi, A. Fong, S., Goddard, A., Gumey, A., Napier, M., Tumas, D., Wood, W. 
A-33 polypeptides. US patent 6,410,708 (Jun 25, 2002). 

10. Ashkenazi, A. APO-3 Receptor. US patent 6,462,176 Bl (Oct 8, 2002). 

11. Ashkenazi, A. APO-2LI and APO-3 polypeptide antibodies. US patent 6,469, 1 44 B 1 
(Oct 22, 2002). 

12. Ashkenazi, A., Chamow, S. and Kogan, T. Carbohydrate-directed crosslinking 
reagents. US patent 6,582,928B1 (Jun 24, 2003). 
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DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

i. I was awarded a Ph.D, by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2 I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1 999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
bom the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 

— significantly Wgher levels-man-m ro^ we- 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
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DECLARATION OF PAUL POLAKIS, PUD. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has ' 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech' s Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular iftRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 yeans of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human g;enes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell In feet, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood genera] rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and farther that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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SECOND DECLARATION OF PAUL FQLAKIS. Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I am currently employed by Genentech, Inc, where my job tide is Staff 
Scientist. 

2. Since joining Genewech in 1999, one of my primary responsibilities has 
been leading Geneniech's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers rhar find use 
as targets for both the diagnosis and treatment of cancer in humans. 

3. As I stated in my previous Declaration dated May 7, 2004 (attached as" 
Exhibit A), my laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal ussue(s); 

4. In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To dare, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B, In Exhibit B, means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

5. As shown in Exhibit B, of the 3 1 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level T 28 of them (ue., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 




6. Based upon my own experience accumulated in more than 20 years of 
research, including the daca discussed in paragraphs 4-5 above and my 
knowledge of ihe relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative to 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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EXHIBIT A 



DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows; 

1 . I was awarded a PhD. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job tide is Staff 
Scientist. Since joining Genentech in 1999, one of ray primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundandy expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of die research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used teclinique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed trom these differentially expressed'genetranscripts^nd^avc used-tliese — 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells.. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protpin expression analyses described in paragraphs 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

\ 6. Based upon my own experience accumulated in more than 20 years of 
research, including thedata discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
. corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willfiil false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 



By: 

Paul Potakis, PhD. 



Dated : S/oVo? 
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GENE EXPRESSION PATTERNS IN BREAST CANCER 



Table 1 Summery of independent amplicons in 14 breast cancer cell lines by 
CGH microarray 



Location 
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CGH were validated, with lq21, J7ql2.—q2L2, 17q22-q23, 20ql3.1, 
and 20ql3.2 regions being most comirnonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identifie-d at 9pl3 (38.65-39*25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression «3ata on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl J— pJ2 (Fig. 3-4). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3#). In addition^ several genes, including the 
homeobox genes HOXB2 and HOXB 7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3 B, inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, andZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP1I-361K8 was la- 
beled with SpectrumOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGrecn- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalhvfixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the N1H. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and VCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 '-G AGC AG AGGG ACTCGGACTT-3 ' 
and 5'-GCGTCAGGTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e., belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. IB). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1 ,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profile*. A, genes in the 
7pl 1 -pi 2 ampltcon in the MDA-468 cell line are highly expressed {red dots) and include 
the EGFR oncogene. B, several genes in the 17ql2. I7q2l .3. and I7q23 amplicons in the 
BT-474 breast cancer cell line are highly overexpressed {red) and include the HOXB7 
gene. The data labels and color coding arc as indicated Tor Fig. 2C. Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (grre/t) to MDA-468 (A) and HOXB7-iptc\f\c probe {red) and chro- 
mosome 17 centromere (green) to BT-474 cells (B). 
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Copy number ratio 



Fig. 4. List of 50 genes with a suusu^ly 
significant correlation (a value <0.O5) between 
gene copy number and gene expression Name 
chromosomal location, and the a value for each 
gene we indicated. The genes have beenordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
?70 genes whose expression was significantly influenced by copy 
number across all i4 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) arc implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
thatcould not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in > 1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somaric genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



• Internet address: htrp:/;www.gcneontology.or^. 



° Internet address: httpV/www. ncbi.nlm.nih.gov/cn1rc2. 
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level copy number increase. Low4cvel copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more iiruportaiit than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, haye an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 

m Tte CG^mkrcOTay analysis identified 24 independent breast 
cancer amplicons. We defined the precise bouiidaries ^ ^am- 
nlicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the bomcobox gene region at 17q2L3 and led to the over- 
expression of the HOXB? and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexposing HOXB7 in breast cancer and suggest that 
HOXB? contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poorprognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing ~2% of all genes on the array), induing not only 
previously described amplified genes, such as HER-2, MYC t 
EGFR ribosomal protein s6 kinase, and AIB3, but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biologicaMnsights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12 000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer, and (c) identification of a set of 270 
aenes the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q2i.3 implicated amplification and overexpression of the 
HOXB 7 gene in breast cancer, including a clinical association 
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between HOXB 7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development 
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ABSTRACT 

Genetic changes underlie rumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed tbe use of comparative genomic hybridization (CGII) to analyze 
the pattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantiUte the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent ampjlcons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and tow-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing ovcrexpression and 10.5% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including tbe HOXB7 gene, 
the presence of which in a novel amplicon at 17q2J*3 was validated In 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA mlcroarrays revealed hundreds of 
novel genes whose overexpression Is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDN A microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid rumors. Besides amplifications of known oncogenes, over 
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Fig. 1. Impact of gene copy number on global gene expression levels. A. percentage of 
over- and undcrcxprcssed genes (7 axis) according to copy number ratios (X axis). 
Threshold values used for over* and undcrexpression were >2.I84 (global upper 7% of 
the cDNA ratios) and < 0.4826 (global lower 7% of the expression ratios). B, percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >1.5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: {a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes whose mRNA expres- 



5 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridisation; RT-PCR. reverse transcription- PCR. 
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Fin. 2 Genome-wide copy number and expression analysis in the MCF-7 breast cancer cell line. A. chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue 
line\ across the entire «nomc from lp telomere to Xq telomere is shown along with ± I SD {orange lines). The black horizontal tin* indicates a ratio ofl .0; red line, a ratio of 0.8; 
and ereen tine, a ratio of 1 2 &~C. gcnome-wlde copy number analysts in MCF-7 by CGH on cDNA microarray. The copy nnmbcr ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In individual data points are connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal Une.Jbe 
«mv number ratio of I 0 In C. individual data points are labeled by color coding according to cDN A expression ratios. The bright red dots indicate the upper 2%. and dark red dots. 
mTwxt5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowtst 2%, and dark green dots, the next 5% of the expression ratios 
(underexpiessed genes); the rest or the observations are shown with Woe* crosses. The chromosome numbers are | shown at the bottom of the figure, and chromosome boundancs are 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND IVJETHODS 

Breast Cancer Cell lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCCI428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (11-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 >tg of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Alu\ and Rsa\ (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
^g of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization ( 14, 1 5) and 
posthybridization washes (13) were done as described For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty \i% of reference RNA were 
labeled with Cy3-dUTP and 3.5 *xg of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, toe ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spoiled four times onto the 
array. Low quality measurements (i.e.. copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and'or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio >l.43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and gene? with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled ) for amplification (ratio, > 1 -43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (1). Wc calculated a weight, Wg, for each gene as follows; 

_ ~ °V> 
w t " <r gl + a# 

where m gXi o* ffl and m^, o^ denote the means and SDs for the expression 
levels for amplified and nonamphfied cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amp li con Mapping. Each 
cDNA clone on the microarray was assigned to a Unigcne cluster using the 
Unigene Build HI. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldcnPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



* Internet address: http7/rc«arcrunhgri.nU^.gov/miCToarray/downloadable_cdna.html. 
7 Internet address: www.gcnomc.ucsc.edu. 
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identifying the starting position of the b«t and longed i^t A of 
any DNA sequence represented in the corresponding UmGene 
duster (10) against the "Golden Path" genome assembly 
(http://genome^csc.edu/; Oct 7, 2000 Freeze). For UmGene 
dusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for ail elements representing the same UmGene 
duster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (Le. reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
Renes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the un- 
proved spatial resolution of array CGH, we ordered (1 luores- 
cence ratios for) the 6,691 cDNAs according to the p Golden 
Path" (http://gCTome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Paralld analysis of DNA from cell lines cwrtaimng 
different numbers of X chromosomes (Fig, 16), as we did before 
(i\ demonstrated the sensitivity of our method to detcctsmgle- 
Sy loss (45, XO). and 13- (47,XXX). ^ 
2jlfold (49,XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorcscwice 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig la), detected m the turoore desirite tte i>resew 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gam and loss were readily iden- 
tifiable. For example, gains within Ig, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell toes/tumors 
(90%/69%, 100%/47%, 100%/60%, 1^^^^" 
hr), as were losses within lp, 3p, 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectively), consistent 
with published cytogenetic studies (refs. £-4; a complete : listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P ~ 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P ~ 0.04), and harboring TP53 mutations (P « 
0.0006) (sec Table 4; which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome S\ which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 7b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 1 7 and 20, can b e 
found in Figs. 6 and 7, which are published as supporting 
information on the FN AS web site). ' . _ 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively); and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; ix., a significant 
fraction of highly amplified genes appear to be corresponding/ 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 

"lb determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 
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breast cancer cell lines and tumors* average mRNA levels 
tracked with DNA copy number across alt five classes, in a 
statistically significant fashion (P values fw ^^^d«it s 
r tests comparing accent classes: ^ ^* JL,' 
5 x 10-*. 1J k 10- 2 ; tumors, 1 x 10" 43 , 1 X W^5x 10 -\ 
1 X 10-*). A linear regression of the average tog(DN A copy 
number), for each class, against average k>g(m 1W A levd) 
demonstrated that on average, a 2-fold change m DNA copy 
number was accompanied by 1.4- and 13-foJd changes m mRNA 
level for the breast cancer cell lines and tumors, respecuvery jt-jg. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normaWiaped curve, 
but with the peak markedly shifted in the positive direction from 
zero This shift is statistically significant, as evidenced m a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
iene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig 46) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig. 4. Genome-wide influence of DNA copy number .Iterations on mRNA levels. (a) for breast cancer ceO lines (gray) and tumor samples (black), both 
mean-centered mRMA fluorescence ratio (logj scale) quartDes (box plots indicate 25th, 50th. and 75th percentile) and averages (diamonds; V-vahie error ban 
Indicate standard errors of the mean) are plotted for each of five classes of genes, representing DMA deletion (tumor/normal ratio < 0-8), no change (OS-1.2). 
low- 0.2-2), medium- (2-41 and high-level (>4) amplification. P values for pair-wise Student's t tests, comparing averages between adjacent classes (moving 
left to right), are 4 x 10-* 1 x 10~*» S X 1CT* 1 x 10-*<ceU lines); and I x 10-* t X 10-»* 3x 1<H\ 1 X 10^ (tumors), (b) Distribution of correlations between 
DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples. (£> Hottrf observed versus expected correlation coefficients. 
The expected values were obtained by randomization of the sample labels m the DMA copy number data set The One of unity is indicated, id) Percent variance 
in gene expression (among tumors) directly explained by variation in gene copy number. Percent variance explained (black line) and fraction of datarctained 
(gray line) are plotted for different fluorescence intemity/background (a rough surrogate for slgnal/nobe) cutoff values. Fraction of data retained* relative 
to the 1.2 Imensrty/background cutoff. Details of the linear regression model used to estimate the fraction of variation in gene expression awfcutaWe to 
underlying DMA copy number alteration can be found in the supporting information (see UthmtSno the fraction of Variation tn Gene Expression Attributable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
1% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. This still undoubtedry 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a scries of human breast tumors demonstrates the 
usefulness of defining ampticon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring ampltcon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we arc likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et aL (15) recently reported that in metastatic 
colon rumors only —4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 
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Contributed by Patrick O. Browa August 6, 2002 
Genomic DNA copy number alterations are key genetic events In 
the development and progression of human cancers. Here we 
report a gerc>m^wkte mkroatray consparative genomic W>no- 
Uation (array CGH) analysis of DNA copy number variation m 
a series of primary human breast tumors. We have profited W*A 
copy number alteration across 6,691 mapped human genes. In 44 
predominantly advanced, primary breast 
cancer cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies^the high- 
resolution (gene-by-gene) mapping of ampKcon boundaries and 
the quantitative analysis of ampllcon shape provide significant 
Improvement in the localization of candidate oncogenes. Parattei 
microarray measurements of mRNA levels reveal the remarltable 
degree to whkh variation In gene copy number contributes to 
variation in gene expression in tumor cells. Spedf lea ly. we find 
that 62* of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number Influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-foW change in DNA copy number Is associated with a corre- 
sponding 1.5-f old change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors Is directly attributable to underlying variation In gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1). have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
51 FGFR1 (8pll), MYC (8q24), CCND1 (Uql3) t ERBB2 
mom and ZNF217 (20ql3)J and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)], the relevant gene(s) within 
otifer regions '(e.g., gain of lq, 8q22. and 17q22-24. and loss of 
8p) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel m 
the same samples (8). using the same DNA microarrays. we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 
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this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transmptional 
programs of breast tumors. ' : ■ ' ■ ;, : ■■ J _ ; 

Materials and Methods 
Tumors and C*D lines* Prim^ 

large (>3 cm), intermediate-grade, infihratmg ductal carcino- 
mas, with more than 50% being lymph node P°^vj^ 
fraction of tumor cells within sr*ecirnens averaged at least 50%. 
Details of individual tumors have been published (8, 9), Mid 
axe summarized in Table 1. which is pubKshed as supporting 
information on the PNAS web she, ^www.pnasxirg. Breast «ncer 
cell lines were obtained from the American TVpe Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform, extraction 
followed by etbanol precipitation. 

DNA Labeling »nd Microarray HylwrHixat^ Genomic DNA label- 
ing and hybridizations were performed equally as described 
in Pollack ; * aL (7), with slight modfficatoons, ^.^crograxns 
of DNA was labeled in a total volume of 50 J?^***™*™ 
volumes of all reagents were adjusted accordtn^y. TtttDNA 
(from tumors and cell lines) was fluorescently labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (Le, UniGene clusters). The 
"reference" (labeled with Cy3) for each bybndization was nor- 
mal female leukocyte DNA from a single donor. Tte fabncaUon 
of cDNA microarrays and the labeling and hybnthzaUon of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were ^canned 
on a OenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable, DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted ^as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 



Abbreviation: CGH. compamthre genomic hybridtoattan. 

*, ~* ™«*t r»«t>«ts ihooW be addressed *t Department of Pathology. StarrfonJ 

U^S^S^RBU.^ *»0~pu, Dr>~ S Wo*. 

CA 94305-5176. E-mail: poUack10rtarrford.edu. 

—Present address: 2yomy» Itk, Maywa/d. CA 

PNAS | October 1.2002 | vol.99 | no. 20 | 129W-12W* 



studies. For example, the study of Platzer et aL (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression ur metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion' of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gcnc(s) 
within amplicons (16)), on a large number of samples, m addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
mteroarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and/by extension, the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number* Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18). beyond the anrolifkation of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochioroetric relationships in cell metabolism and physiology 
(e.g., proteosomc, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in easeer— 
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HER-2/neu Breast Cancer Predictive Testing 
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Each year, over 1 82,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease- free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(1HC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, Path Vys ion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the rumor, 
enabling differentiation between low- versus high-amplifica- 
tidh. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); .this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 . cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
bascd therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients . 

Selection of patients for Herceptin 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin 0 in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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PT code information 

HER-2/aeu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

88271 x2 Molecular cytogenetics, DNA probe, each 
88274 



88291 



Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
Cytogenetics and molecular cytogenetics, interpre- 
tation and report 



Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest 0 . The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion ©f the subse- 
quently added chromogen results in formation of visible 
\ reaction product at the antigen site, the specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at 1 7q 1 1 .2- 1 2 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 1 7s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vy sis-certified Cytogenet- 
ics laboratory at SHMC. 



References 

1 . Wingo, RA„ Tong. T„ Bolden, S., "Cancer Statistics", 1995:45:1 :8-31. 

2 "Cancer Rates and Risks", 4* ed., National Institutes of Health, National 
Cancer Institute, 1996, p. 120. 

3 Slamon, D J„ Clark, G.M., Song, S.G., Levin, WO, Ullrich, A, McCutft, 
WJL "Human breast Cancer Correlation of relapse and survival with 
amplification oftheher-2/neu oncogene". Science, 235:177.182, 1987. 

4 Xing, WX, Gilchrist, K.W., Harris, CR, Samson, W„ Meisner; L.F. 
"FISH detection of HER-s/neu oncogene amplification in early onset 
breast cancer". Breast Cancer ftes. And Treatment 39(2):203-2 12, 1 996. 

5 . Press, M.F. Bernstein, L., Thomas, P.A., Meisncr, L.F., Zhou, J.Y>, Ma, 
Y., Hung, G., Robinson, R.A., Harris, C, El-Naggar, A„ Slamon, DJ., 
Phillips, R.N., Ross, J.S., Wohnan, S.R., Flom, KJ., "Her-2/ncu gene 
amplification characterized by fluorescence in situ hybridization; poor 
prognosis in node-negative breast carcinomas", J. Clinical Oncology 
15(8):2894-2904, 1997. 



Provided for the clients of 

Pathology associates Medical Laboratories 
PacLab Network Laboratories 

Tri-Cities Laboratory 
treasure Valley Laboratory 

For more information, please contact 
your local representative. 



€> 1999 by Pathology Associates Medical Laboratories. 



4050 Medical Sciences: Konopka et ai 

supplemented with 5% dialyzed feUd bovine 
were then resuspended in 2 ml of the minimal medium. 
Coding was started with the addition of Ptt^ 
phate (1 mCi/ml; ICN; 1 Ci - 37 GBq) and continued at 37 C 

itaUons were carried out as described (10). Cells (1.5 x jo j 
were washed with P hosphate : buffercd ^ «<™^ 
with 3-5 ml of phosphate lysis ^0*^^7 5/ 
NaDodSO</0.5% deoxycholate/10 mM Na 3 HP0 4( pH 7.5/ 
100 mM NaCl) with 5 mM EDTA and 5 mM phenylmethyl- 
fluoride. Extracts were clarified by ce^tnfu^tion 
and precipitated with normal or rabbit anU-abl s era i (and- 
pEX-2 or anti-pEX-5) (17). The precipitated proteins ^were 
electrophoresed in a NaDodSO</8% polyacrylarnide gel. 
"P-labcled proteins were detected by autoradiography. 
Alternatively, abl proteins were detected by immunoblW 
Extracts from unlabeled cells were clarified, > and proton* 
were concentrated by immunoprecipttation with rabb t arn> 
sera against aW-encoded proteins (antt-pEX-2 and an"-pEX- 
5 combined a7)] and then fractionated in 8% acrylamide gels, 
Sroteins were transferred from the gel to i mtro«nulose 
filters, using protease-facilitated transfer (18). The aW- 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and peroxidase-comugated goat ana- 
mouse second stage antibody (Bio-Rad) ^r^flopM 
Rabbit antisera and mouse monoclonal antibodies to abl 
proteins were prepared using ^ctcrially expressed ^ 
[he v-abl protein as immunogens (17. 19). Anti-pEX-2 anti- 
bodies react with the internal tyrosine kinase domain and 
anti-pEX-5 antibodies react with the carboxyl-terminal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10 s cells by the 
NaDodSCVurea/phenol method (20). Polyadenylylated 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a \% agarose/formalde- 
hyde gel and transferred to nitrocellulose, obi RNA species 
were detected by hybridization with a nick-translated v-cW 
fragment probe (21). f 7 f 

DNA Analysis. DNA was prepared from 5 x 10 7 cells of 
each cell line and processed for Southern blots with a -v-aW 
probe as described (21). 

RESULTS 

Variable Levels of P21<T* M Are Detected In PhNPosltive Cell 
Lines. Ph^positive and Ph'-negative, Epstein-Barr virus- 
transformed B-lymphocyte cell lines derived from the same 
patient were examined for P210^ w synthesis by irnniuno- 
precipitation of pP]orthophosphate-labeied cell exacts 
with anti-abl sera (Fig. 1). The normal c-aW protein P145 
was detected at a similar level in multiple Prepositive and 
Ptf-negative cell lines. P210~ bt was only detected in the 
Ph'-postuvc cell lines because the bcr-a W chimeric ^gene 
which encodes P210 c resides on the Ph l (4, 5, 11, 13). The 
level of P210 cibl was about 4- to 5-fold higher than the level 
of pi45~bi irl tbc S K-CML7Bt-33 cell line (Fig-M, +). The 
Ph ^positive erythroid-progenitor ceU line K562 (C) showed 
a level of P2i0 c • Ib, about 10-fold higher than P145 c - W . 
However, the level of P21<r~ bl was about onefifth that of 
p 145c .»bi in lh? p^-positive SK-CML16BM ceil hne (Fig. lB t 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P2Kr~w varies over a 
20-fold range between these Prepositive B-cell lines. Anal- 
ysis of four additional Ph l -positive B-ce)l lines demonstrated 
that the level of P210 c ** w fell into two general Classes; some 
cell lines had a level of ¥2W^ 1 similar to SK-CML7Bt-33 
and others had the low level similar to SK-CML16Bt-l (Table 
1). This differs from previous studies with Ph'-positive 
myeloid cell lines and patient samples derived from acute- 
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Fio. 1. Detection of variable levels of P210~ M in Ph'-positive 
B-cell lines. Production of P145~*» and P210" M in EP^in-Barr 
virus^transfortned B-cell line* fierived from a blast<nsls i (A) and 
chronic-phase (B) CML paUent was examined by metabolic labeling 
with [»P]orthopho3phate and imraunoprecipitalion. Ph'-negative 
(-) and Pb l -P08itive (+) cell lines derived from each paUent were 
analyzed. The Ph'-negati ve cell line in A,- is SK-CML7BN-2 and w 
is SK-CML16BN-1. The Ph l -positive ceU line in A,+ is 
SK-CML7BI-33 and in is SK-CML16BM. The K562 cell line, a 
^positive erylhroid progenitor cell line soonUneously derived 
froma blast-crisis patient (33), is represented in C. Cells i (1.3 x \V) 
were metaboUcally labeled with 2 mCi of P l P)orthpphospbate for 3-4 
hr and then were extracted and clarified by centrifugation. Samples 
were immunoprecipitated with control norma! serum (Janes 1). 
anti-pEX-2 Oanes 2), of anti-pEX-5 Canes 3) and analyzed by 
NaDodS0 4 /8% PAGE followed by autoradiography with an mtcn- 
sifying screen (3 days for A and C t 10 days for fl). 

phase CML patients, in which P21<T lW was detected at a 
10-fold higher level than ?U5* M (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNAand 
P2joc^w expressed in four myeloid/erythroid-lineage Ph - 
positive cell lines (K562, EM2 r EM3, CML22 ( and BV173; 
refs. 9 and 11). despite a 4- to 5-fold amplification of 
aM-related sequences in the K562 cell line. 

Detection of different levels of P210 wibl in Fig, 1 could be 
due to decreased phosphorylation of P210 c ^ bl t a lower level 
of P210 MW synthesis, or altered stability of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210 c ** bl in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Bt?33 (Fig. 2A, +) 
had a higher level of P210 mW than P145, similar to the results 
with metabolic labeling (Fig. I). We did not detect P210* 1 
by immunoblotting with 2 x 10 7 cells of line SK-CML8Bt*3 
(Fig. 2B, +). Reconstruction experiments using dilutions of 
cell extracts showed that we could detect about 5-10% the 
level of P210^ bl expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210 c ~* w in 
SK-CML8Bt-3 is lower than the level in SK-CML7Bt-33 by 
a factor of at least 10. The level of P210~ bl detected in these 
assays correlated with the amount of P210*- abi tyrosine kinase 
activity that could be detected in vitro (data not shown). 

Different Levels of P210^ M Are Reflected In the Amount of 
Stable bcwbl mRNA. To identify the basis for detection of 
variable levels of P210 c - W , we examined the production of 
the abl RNA. RNA blot hybridization analysis using a v-cW 
: probe (Fig. 3) showed that the normal 6- and 7-kb c-abt 
raRNAs were present at a similar level in Prepositive and 
-negative cell lines derived from different patients. However, 
the 8-kb mRNA tbat encodes P210 c * w was detected at a 
10-fold higher level in SK-CML7Bt-33 (Fig. 3A, +) than in 
SK-CNfl-16Bt-l (B, +), which correlated with the relative 
level of P210 Mlbl detected in each cell line. Analysis of 
additional cell lines demonstrated that the level of 8-kb RNA 
directly correlated with the level of P210 c ' bl (Table 1). The 
variation in level of 8-kb RNA detected in these ceil lines was 
not due to loss or gain of Ph l ( because cytogenetic analysis 
confirmed the presence of Ph 1 in these cell lines (ref. 16 and 
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Tabic 1. Relative levels of bcwbl expression in EpsteiprBaix 
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Cell line* 


CML phase* 


Ph** 


P210* 


8-kb 
mRNAV 




BC 








SK-CML8BN-10 


Chronic 








SK-CML8BN-12 


Chronic . 








SK-CML16BN-1 


Chronic 








SK-CML35BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ 


+ + + 


+++ 


SK-CML21BM 


Acc 


* 


+ + + 


+++ 


SK-CMUlBt-6 


Acc 


+ 


++'+ 


+++ 


SK-CML8Bt-3 


Chronic 


+ 


+ _ 


± " 


SK-CMU6BM 


Chronic 


+ 


+ 


+ 


SK-CML35Bt-2 


. Chronic 


+ 


■ + 


+ 


K562 


BC 


+ • 


+++++ 


+++++ 


BV173 


BC 


+ 


+++++ 


++++ + 


EM2 


BC 


+ 


++ + + + 


+++++ 



•Cell lines denvea irom uwl p»ucm».v/ yo»«w»«.™ T » 
Bpstein-Barr virus as described (16). Names of cell Unes indicate 
patient number and Pb l status: SK-CML7Bt indicates a cell line 
derived from patient 7 that carries the 9;22 Ph> translocation; N 
indicates a normal karyotype, Myeloid-erythroid ecu lines (K5W, 
. EM2, and BV173) are described in previous publications (9, 11, 22. 

tStatus of patient at the time cell line was derived. BC, blast crisis; 
Acc, accelerated phase. 

tpresence (+) or absence (-) of W as demonstrated by karyotype 
or Southern blot analysis. ', 

iKIO** 1 " detected as described in legend to Fig. 1. B-ceU Unes 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+++) than levels of P145. Chronic- 
phase-derived cell tines had P210 levels lower than or just equivalent 
(+) to the level of P 145. Myeloid and crythroid lines had levels of 
P2J0 5- to 10-fold higher than P145 (+++++). 

tEight-kilobase bcr-abt mRNA delected as described in legend to 
Fig! 2. Symbols: *, borderline detectable; ■*■++*■+• ,ev ?* of *^ 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c-aWmRNA 
species; +++. level of 8-kb mRNA 3- to 5*fold higher than that of 
the 6- and 7-kb species; + , a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of aM-related sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of abl sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 c -° w detected. This could be mediated 
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Fio 2 Analysis of steady-state abl protein levels by immuno- 
blotung/Ccll extracts prepared from **J&J^ 
CML7BN-2 (A SK-CML7Bt-33 (A,+), SK.-CML8BN-10 (fi. T ), 
and SK-CML8Bt-3 (B.+) were, concentrated by immunoprecip- 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then electro- 
phoresed in a NaDodS0 4 /8% polyacrylamidc gel and transferred to 
nitrocellulose, using protease-facilitated transfer (18). abl protems 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 oW-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-cotyugated goat anti-mouse 
second-stage antibody (Bio-Rad) for development. 
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Fio. 3. Comparison of aW RNA levels to Fh'-positive and 
-negative B-ceti lines. The levels of the normal 6- and 7-kb c-abl 
RN As and the 8-kb bcr*bl RNA were analyzed by blot hybridization 
using a v-«W probe. RNA was extracted from Ph^negative Unes 
SK-CML7BN-2 (A,-) and SK-CML16BN-1.(J?,-). from Ph l -pos- 
itive lines SK-CML6BI-33 U,+) and.SK-CML16Bt-3 and 
from line K562 (C,+) by the NaDodSCVurea/pheooi method (20). 
Polyadenylylated RNA was purified by oUgo(dT) affinity chroma- 
tography, and 15 Mg of each sample was electrophoresed in a 1% 
agarose/formaldehyde gel and then transferred to nitrocellulose.The 
blotted RNAs were hybridized with amckrtranslated v-oW fragment 
probe (21) and then autoradiographed for 4 days. 

by factors influencing the transcription rate of the bcr-abl 
gene or the stability, of the mRNA; 

DISCUSSION 

Several Unes of evidence suggest that formation of Ph* is not 
* the primary event that affects the stem ceil in CML. Patients 
have been identified that present with the cUriical picture of 
CML but only later develop Ph 1 (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-hetero2ygous females with CML that demon- 
strate stem-cell clbnaUty by isozyme analysis among cell 




Flo. 4. Southern blot analysis of abl sequences in Ph'-positive 
and -negative B-ceU Unes. High molecular weight DNA (15 MK) was 
digested with restriction endonucleasfc BomHl, separated in a Q.89& 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick-translated, 2 .4-kb Bgl II v-o W 
fragment (1.5 x 10« cpm/w ref. 21) and exposed for 4 days. (A) 
Autoradioeram of ^specific fragments in cell lines HL-60 (lane 1), 
EM2 (lane 2), K562 (lane 3). SK-CML7Bt-33 Oanc 4) SK-CMUBt-3 
(lane 5). SK-CML16BM (lane 6), SK-CML21Bt-e Cane 7] I. SK- 
CML35Bt-2 (lane 8), SK-CML7BN-2 (lane 9), SK-CML8BN-2 (lane 
10), and SK-CML35BN-1 (lane 11). (fl) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24. 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph\ a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 

disease. , , ™-i • .t. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the v-abt gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). . . 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-aW 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results In a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of rranj-acting factors that occur during evolution of 
the disease. Our analysis of P210 c ' lbI and the &-kb mRNA in 
Epstein-Barr virus-transformed Ph l -positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c -* b * one-fifth that of P145**", as detected by metabolic 
labeling with [ 32 P]orthophosphate and immunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases; relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c ** bI expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Pb^positive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c » bi in those 
derived from patic v. • at more advanced stages of the disease. 
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It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph*. 
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ABSTRACT Toe consistent cytogenetic translocation .of 
chronic myelogenous leukemia (the phtiadelphla chromosome, 
Ph 1 ) has been observed In cells of multiple hematopoietic 
lineages. Tins translocation creates a chimeric gene composed 
of breakpolnt-cluster-region {bcr) sequences from chromosome 
22 fused to a portion of tjte abl oncogene on chromosome 9. The 
resulting gene product (TCIO**** 1 ) resembles the transforinutg 
protein of the Abelson murine leukemia virus In Its structure 
and tyrosine kbVase pctivtty. PlNr - * 1 Is expressed to Ph% 
positive cell lines of myeloid lineage and to dinlcal specimens 
with myeloid predominance. We show here that Epsteln-Barr 
virus-transformed B -lymphocyte lines that retain Ph 1 can 
express Y2\V >M . The level of expression In these B-cell lines Is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph* template. 

Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abt gene on chromo- 
some 9 into the middle of the breakpoint-cluster region {bcr) 
gene on chromosome 22 (3-6)/ Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar g-kilobase (kb) mRNA to 
each case (5-9). The hybrid kr-aW message encodes a 
structurally altered form of the abl oncogene product, called 
P210 e "* w (10-13), with an ammo-terminal segment derived 
from a portion of the exons of bcr On chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exohs of the c-abt gene on chromosome 9. The chimeric 
structure oibcr-abl and the resulting P210^ * bl is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P1W**" transforming gene product. 
Both proteins have very sjmilar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by thejx ATP requirements from 
the recently described tyrosine kinase activity of the c-abt 
gene product (15). 

The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "adverthement" 
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In concert with structural modification of the amino- 
tenninal portion of the abl gene, increased level of expression 
has been implicated in activation of c-abt oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and P210 c ' aW than 
the c-aW mRNA forms (6 and 7 kb) and P145 C ^ 1 gene product 
(5, 8, 9, li)r The higher level of ' expression of the chimeric 
bcfrabl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7^kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the. normal c-<»W messages (5, 6). 
- We have analyzed a series of JEpstein-Barr virus-immor- 
talized B -lymphoid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of fch 1 always results in synthesis of the chimeric 
bcr*abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
s.how that ceU lines that retain Ph 1 do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c ***^ suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-abl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report* 
ed (liS). The cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line ?3; and SK-CML7BN- 
2 refers to B-celi line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just, prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI.1640 
medium with70% fetal bovine serum. We have not observed 
any consistent pattern of In vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 



Abbreviations: bcr, breakpoint-cluster region; CML, chTonic 
myelogenous leukemia; kb, kilobase(s). 
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1 lulroduetioQ 

A proteomc has been defined as ibe'p'rotein complement 
expressed by the genome of an organism, oi; in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell U|. la the most common im- 
plementation of prorebme analysis (he proteins extracted 
from the cell or "tissue analyzed are separated by high 
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Proteomc analysis: Biological assay or data Archive? 

In this review we examine the current state of protcome analysis. TJcre we 
Re main issues discussed: why it is necessary to study proteomes; how pp. 
^tneTcan be analyzed with current technology; and how^proUome analysis 
can be used to enhance biological research. Wc conclude that protcome anal, 
wis is an essential tool in the understanding of regulated biological systems. 
Cunent technology, white still mostly limited to the more abundant proteins, . 
enables' the use of protcomo analysis both to establish databases of profeins 
treseoL and to perform biological assays involving measurement of multiple 
variable^ We believe lhat the utility of proteomc aoalyakin future* biological 
lesearchwiirepnlinut lo be enhanced by further improvements la .analytical, 
technology. . \ ' 

resolution two-dimensional gel- electrophoresis (2-DE), 
detected in the gel and identified by their amino acid - 
sequence. The ease, sensitivity and speed with which gel* 
separated proteins can be identified by the use of recently 
developed mass spcclrometric techniques havo dramall- 
cally increased the interest in proteomc technology. One. 
of the most allracU ve features of such analyses is thai com- 
: piex biological systems can potentially bo studied in their 
entirely, rather than as a multitude of individual compo- 
nents. This makes it far easier to uncover the many com- 
plex, and often obscure, relationships between mature 
gepe* products m cells: Lvge;scal6 -proteomc charatttnxa- 
tfon projects have been undertaken for a number of dif- 
ferent organisms and. cell types. Microbial protcome pro* 
. jects currently ip progress include, for example: SacAara- 
thyca cereviiiat ili Salmonella tntutco |3J, Sj/iroptetma - 
mtStytrum 14), Mycobacterium tuberculosis 151, Odirobxtc- 
trum anikropl'Vk Haemophilus irtfhietitae pJ,.^neoSo- 
cystis spp. |8J, £rd»erieA<a xqII [9\, Rhtzoblvm iegumtno- 
sarurn (10], and Dtctyaiittium dhcotdcum (III Proteomc 
projects underway Tor tissues of more complex organ- 
isms include those for: bumin bladder squamous cell 
carcinomas (12), human Uver [13], human plasma (13]; 
human teralinocytes (12), human fibroblasts 112), mouse 
kidney 1 12], and rat serum \Ul In this manuscript wo cri- 
tically assess the concept of proteomc analysts and the 
technical feasibility of establishing complete proteomp 
. mips, and discuss ways Jo which proteomc analysis and 
biological . research . Intersect. ' 



Z Rationale for proteomc analysis 
The dramatic growih in both ibe number of genoroo 
projects and the speed with which genome sequences 
ate being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences (US-I7D. The description of the 
state of a- biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent, technical 
advances including the development of differential dis- 
play-PCR (181, cDNA microarray and DNA chjp techno- 
logy [19, 20] and serial - analysis of gene expression 
(SAGE) (21. 221, II « now feasible to establish global and 
quanlitsliye mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is* known, at a 
speed and sensitivity which is not matched by current 

* oiD-oiiun/i 1 11-1161 Jl7.KK.iCrt> 
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proicin analysis technology. Given . Ihe long-stBDding 
paradigm in biology thai DNA synthesizes RNA which 
synthesizes protean, and the ability, to rapidly establish 
global, quantitative mRliA expression maps, Ibe ques- 
tions which arise are why (ethnically complex proteomc 
projects should be undertaken and what specific types of 
information could be expected from proteomc projects 
which cannot be obtained from gcobmic and transcript 
profiling projects. We see three main reasons for pro- 
teomc analysis to become an essential component In the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (u) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (Hi) proteomes 
arc dynamic and reflect the slate of a biological system. 

2.1 Correlation between mRNA and p rot tin expression 
lev«U 

Interpretations of quantitative mRNA expression profiles 
frequently Implicitly or explicitly assume thai for specific 
geoes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyca ccrevtstoc 
growing at mid-log phase (S. P. Gygt tt submitted for 
publication); mRNA expression levels were calculated 
from published SAGE frequency, tables \22l Protein 
expression levels were quantified by metabolic radiota- 
beling of the yeast proteioj, liquid scintillation counting 
of tbo protein spots separated by high resolution 2-DE 
and mass spectrometry identification of the prolein(s) 
migrating to each spot. The selected 80 samples coosli-. 
tutc a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general tread but no 
strong correlation between protein and transcript levels 
(Fig. 1). tit some gcocs studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
slate protein expression levels wcic maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
tt oL, submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

in the mature, biologically actiy'e forra rnany proteins arc 
post-tnnslationally modified by glycosylation, phosphor- 
ylation, prenylalion, acylation, ubiq urination or one or 
more of many other .modifications [23J and many pro- 
teins aic only functional if specifically associated or com- 
pleted wiib other molecules, including DNA, RNA. pro- 
teins and organic and inorganic cefaclors. Frequently, 
modifications we dynamic and reversible and may alter 
the precise three-dimensional structure and the slate -of 
activity of a proiein. Collectively, tKc stale of modifica- 
tion of (be pioteins which constitute a biological system 
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. are important indicators for the state of the system. The 
' type of protein rrtodification and the sites modified at a 
specific cellular state, can usually not be determined 
from the gene sequence alone, 

23 Proteomes are dynamic and reflect the stale of a 
biological system 

A.siogJe genome, can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
' growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular staies- 
which are characterized by significantly 'different pro*. 
Iconics. The proteome, in principle, also reflects events 
that are under treoslaUonal and posl-transUtional con- 
trol. It is therefore expected that protectants will he able 
to provide the most precise and detailed molecular des- 
cription of the slate of a cell or tissue, provided that the 
external conditions defining the slate are carefully deter- 
mined. In answer to the question of .whether the study 
of proteomes is necessary for. the analysts of biomolec- 
ular systems, it is evident that tho analysis of mature pro* 
tern products in ceils is essential .as there are numerous 
levels of control of protein synthesis; degradation, 
processing and modification, which- are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis, technology : * 

3.1 Tfechoicfll requirements of proteome technology 

In biological systems the level of expression as welt as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state- of the system: Comprehen- 
sive analysis of the identity, quantity and stale of modifi- 
cation of proteins iherefore requires the detection and 
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quantitation of the proteins which const lute Ihr s*l£ 
and analysis of UifTcrcnUalty proceed 
/number of inherent difficuUi« .n protein ana lysi 
which complicate these iwte.-Kisl. p~ cannot be 
itified. IHs possible to produce ^ 
psitohr protein by ovcr<* P rcsston m 
terns. However, since many, protems >rt 4*"^* 
pon-uansuiionatty modified, the y eanno be y an> 
plificd in the form in which they MlrM^%«" 
CSle-l system. It is fluent* ditfcultlo p« ftom 
the native source sufficient amounts of a 9™*!" 

.: analysis. From a technological point of V**^' 
late/ into the need for high sensitivity 
M. Second, many protein, are modtfico- q Pro- 
cessed P osMrar*laU*onally. Therefore *H add Uon to the 
protein identity, the structural basis *f OTe»l««r 
Edified isoforms also needs to be delerrnmcd. The dtf- 
IXion of a constant amount of protein over several 
differentially modified isoforrns further reduces the 
amount of each specie,, available for ?^ J* 
piexity and dynamics of post-translaUonal protein cdl 
mg thus significantly complicates proteomc siutws. 
Thud. P'Oteiiu vary dramatically with respect 10 the r 
solub my in convnonly used selvepts. Tbere are few f 
«y; solvent conditions in which all protein, art soluble 
and which ate also compatible With protein »^;™' 
makes the development of protein purmcation methods 
particularly difficult since bothprotcln purification pad 
solubility have to be achieved under the, same. condi, 
-Sou. Detergents, In particular sodium dodecyl sulfate 
(SDS). arc frequently added to aqueous * 
maintain protein solubility. Toe company witt . SDS 
is a big advantage of SDS. polyacrylamide geKeUctror:. 
□bo«sis (SDS-PACB) over other prolem separation 
^chniqucs. Thus. SDS-PAGB 
electrophoresis, which also uses SDS and oujcr deter- 

' «att are the. most general and preferred methods i far 
the purification of small amounts of proteins, provided 
that activity docs not necessarily need to be maintoined. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use method* which are as 
moid a* possible to allow completion ' of the project 
.within a reasonable ume* frame. Therefore, « successful, 
general proleomics technology requires wgb seruiUvily, 
high throughput, the ability to differentiate differentia^ 
modified proteins.* and the ability to quantitatively dis- 
play and analyze all the proteins present m a sample. 

3.2 2-D electrophoresis- nWs spectrometry: a common 

loipltmcDialloB of prcicome aealrsls 
Tnc most common currently used implementation of 
- Pfoieome aoilysis technology is based oo tK separation 
of proteins by two-dimensional O^SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS), in 2-Dfi, proteins arc first senar- 
atcd by isoelectric focusing (IEF) and then by SDS- 
PAGE in tbe second, perpendicular dimension. Scpar- 
eled proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimenstooaJ arrays of 
proteins. 2-DE gels are, at present,. the most commonly 
used means of glohil display of proteins in complex 
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samDltsVThe separation of thousands of proteins has 
o^n achieved* in a single gel 1**5! 
modified proteins are frequently separated; Due to the 
compatibility tf.*DB with high conceatrauooi.^f deter- 
gents, pmtein denaluranU and other additives prornoirng 
protein solubility, the technique . is widely used. 

Toe second step of this type of proleornc analysis - i» Up 
identification and analysis, of separatedproleins Individ- 
ua! proteim from polyacrylamide |tl? have UadiUonalfy 
been identified using //-terminal sequencing "I 
Internal peptide sequencing P*. 2% immunoMo«ing , or 
comigraiioo with known proteins |3Q1 The recent dra- 
matic grtmh or large-scale. genomic and wrpressed 
sequence ug (EST) sequence databases has resulted iryi 
fundamental change in the way proteins are identified |y 
their amino acid sequence. Rather than by tbe iradiUonbS 
methods described above;, protein sequences are now fre- 
quenUy determined by correlating mas spectral or 
Undern mass spectral data or peptides derived from pro- 
icins, with the- information contained in sequence data- 
bases pi-wi 

There are a number of alterriaiivc approaches to pro- 
teome analysis currently under development TUere U 
considerable inurest In developing ft pioleoma annlytis 
stra B egy which bypasses 2-OB altogether, because ii is , 
considered a relatively stow and tedious process, and 
because of perceived ditticulUcs in extracting proteins, 
from the gel matrix for analysis. However, 2-DE as a 
sia*ting point for prpteome analysis bos many .advan- 
tages compared to other techniques available today. Tbe 
most sigioificani -sirenglbs- of-tbe'2-DE-WS approach 
include the relatively uniform behavior of proteins In 
gels the ability to <mautu> spots and the hi$h resolution 
and simullaneous display of hundreds l© thousands of 
proteins within a reasonable .time frame; 

A schematic diagram of a typical procedure of the Identi- 
fication of gcl-separuted proteins is abown in Rg. 2. Pro- 
tein spots detected in the gel are enzyroaUcaJly or chemi- 
cally fragmented and the peptide fragments are Mated 
' for analysis, as already indicated, mast frequenUy by MS 
or US/HS. There are numerous protocols for the goner- 
ation or peptide fragments from ^et-separaled proteins. 
Tbcy can be grouped into two categories, digestion in 
the gel slice |28, 34) or digestion after elecuouansfer out 
of the gel onto a suitable membrane fJ29, 35-37] and 
reviewed in |38]). In most instance* 'cither technique is 
applicable and yields good -results. The analysis of MS or 
'MS/MS data is an important step in the whole process 
because MS insUjuments cap generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number *f groups have developed 
software systems dedicated lo the use of pepUde MS 
and MS/MS spectra for tl>e identification of .proteins. 
Proteins arc identified by correlating the information 
contained io the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in UNA or protein sequence databases. 

The systems we are cumnuyusing in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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fliwt Schemuk dinrwo of • procedure for idtnlitoioo of/ icK 
feptmed praicbu. PcpUtfti eao tilhci be npirated bj t ttdmiqw 
xucb u IC or CB. of iftfuwt « * mature sad. sorted *» the MS. D»t»- 
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rephy (39, 40| or capillary electrophoresis (411, the anal- 
ysis of the separated peptides by elcctrospray ionizar 
Uoo (ESI) MS/MS. and the correlation of the generated 
peptide, spectra with sequence databases using the 
SEQUBSX program developed at (he University of Wash- 
ington j32, 31}. The system automaUcally -perform* the 
following operations: a particular peptide ion character* 
lied by its mass-lo-cbarge ratio is selected in the MS out 
of all the peptide , ions .present in. the system at a part i- 
. cuter time; the selected peptide too is collided in a colli-, 
sioo ccti with argon (col lis ion- induced dissociation. 
CID). and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CIO spectrum is then corre- 
lated with the CIO spectra predicted from ail (he pep- 
tides in a sequence database which have essentially the 
samo mass as the peptide* selected for CID; this cortela-. 
Uoo matches the isolated peptide with a sequence seg- 
ment to a database and thus identifies the protein from 
which the peptide was derived, there arc a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
became it is curieoUy the rrtost highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 

3.3 fteleJo identification by LC-MS/MS, capillary . 
LC-MS/MS and tiS-MS/MS. 

It has been demonstrated repcateiJIy thai MS has a very 
high intrinsic sensitivity. For' the routine analysis of gel- 
sepirated proteins at high sensitivity, J he most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the pioblcm is the eatiaclioh and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into (he 
MS/MS system without significant joss of sample or 
introduction of unwanted contaminants. We; employ 
three different systems for introducing gel-purified sam- 
ples into an MS. depending on the level of sensitivity 
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required. As an approximate guideline, for samples con-- 
lataiag tens of picoraoles of peptides,. LC-MS/MS is 
roost appropriate; for sample* staining, tow picomole 
amoimtt to high femtomole amouots wo use capillary 
LC-MS/MS; and for samples containing ftmtorrioles or 
less, CB-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to' an HPtC system using a 
0.5 mm diameter or bigger reverse phaie (RP) column 
has been described In detail (42). This system has several 
advantages If a large nuihbor of samples are to ho ana* 
tyztd and all are available in sufficient quantity. Hie 
LC-MS and database starching program can be run in a 
fully automated mode using an aulosampler, thus maxi- 
mizing sample, throughput and minimizing the heed for 
operator interference. Tho relatively largo column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and rsucro-spmyer 140], analyses can be per* 
formed on a small fraction of the sample (less than 5%) 
while the remaindex.of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used lo analyze 
peptide fractions, such as scintilla Uoo of on introduced 
radioUbet, and this data can be correlated with peptides 
identified by CID spectra. 

3.3 J Capillary LC-MS 

An increase of stnsitivily of approximately tenfold can be 
achieved by using a capillary LC system with a 100 urn ID 
column rather than a Q.$ mm ID cotumn as referred to 
above. Since very low Dow rates arc required for such 
columns, most reports have used a precclumn flow split- 
ting system for producing solvent .gradients.. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables. the formation 
of reproducible gradients at very low flow rates ; (low 
nfVmin) without the need for tlow splitting (A. Ducret 
zt o/. ( submitted for publication). Using Ibis capillary 
LC-MS/MS system we were able to Identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
vvere loaded onto the gel 140), This system is a? yet not 
automated and, like all capillary LC systems, is prone lo 
bloccigc of the columns by xntcroparticulates when ana- 
lyzing gel-separated proteins. 

3J3 CE-MS/MS 

The highest level of sensitivity for analyzing gclrsep- 
aratcd proteins can be achieved by using capillary elec- 
trophoresis — mass spectrometry (CJB-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPB-CE) system which was used with triple 
quadrupole and ion trap ESt-MS/MS systems for the 
identification of proteios at the low femtomole to sub'- 
femtomole sensitivity level (43, 44]. While this system is 
highly sensitive, its operation is tabor-in teosive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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microfcbficattd devices for the Reduction of wrhplcs 
into BSt-MS foi high-setaiUvily peptide w.lysis. 

•me basic device is a piece of gUss.tolb which coaiuw& 
of »L in depth and 5IM0 m in •«* « 
clchedby using photouU.ography/etebins .echnlques 
stai to .o «he one, Used to *« scmieonduc.or.adus.ry. 
A simple device is sho*« ih Fig. 3). I* chaao cb «c 
eConectod .9 .0 external high vo Uge power supply |«£ 
ITpies «« manipulated on the de«ce and off U* 
device lo the MS by applying different potentials to U» 
?«c£bL This create/. solv«i,t How by el.cuoosoo.lc 
purnplnV whW. can be. redirected by changmg the posl- 
S„7ofthc electrode. Therefore, witbou the peed for 
vah.es or gates and without any external pumping the 
flow can be redirected byrimply swilcWng the postUoo 
or the electrode, on the deWceiThe direcuoa »d ^teof 
the flow can be modUtoUd by U>« *«« and the polanty 
of the electric field applied tod also by the charge state 
of the surface. 

The typcof dita generated by .he system is illurlMted in 
Pie 4 which shows the mass jpectnim of a peptide sample 
reprelenting the Iryplie digest of carbonic anhydraie ,t, 
290 fmol/pJL Each numbered peak Indicates a peptide suc- 
cessfully Identified as being derived from carbonic an- 



mitioftbiicitMl M»]yU«l fyiUra for CE». 
tonsiiUnj of * oueroBuAJfttd devke. 
cotftd apilUij JecUoMtBOlic pump. * 
ud miciotlttini.par totcifu*. The 
dimension! of ibe chwotii tod luznoYt 
•te «f Iniflttted in U>» text* Tnt c&wwitl* 
on tbt dttleo vue irephuailr catuncod 
10 ouU Uitm owa TUiW*. lUpiodoced 
rrotn (45|, wili pumWoa. 

hydrase. Some .of the unassigneo. signals may be chemical \ 
or ocpliec contaminants. The MS is programmed to.auto* 
martcallv select each peak and subject the peptide lo ClU. 
SSm cTo rpecira are then used-to identify the 
■ protein by eorrcUAion with sequence debases .Therefore, 
ibis system allows us lo concurrently apply a number Of 
orotein digests onto the device, to sequentially mobilize 
fee samples, to automatically generate CID spcetra of 
selected peptide ions and' to search sequence databases 
forofolein identification, these steps are performed auto- 
matically without the need foruser input and protejns can 
be identltled at very low femtomole level seosttivjiy at a 
rat* of. approximately one protein per 15 ram. . 

3.4 Assessment of 2-DE-MS proteomt. technology 

Using a combination of the paalyUcal techniques de- 
scribed above we haw ideotified/the 80 protein spots * 
iadicated in Fig- 5. Ibe protein pattern was generated by 
separating a total of 40 microgram of protein contained 
m a total cell lysale of the yeast strair> YPH499 by h^h 
resolution 2-CBand silver staining of the separated pro- 
teins To estimate how far this type of protcome analysis 
cao pebetrate towards the identification of low abun- 
daoce proteins, wo have calculated the codoo bias of the 
genes encoding the respective proteins. Code* bias is a 
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calculated, measure of the' degree of redundancy of trip- 
let DNA codons used to produce each amino add lo a 
particular gene sequence. It has been sbowo to be_a 
useful indicator of the level of the protein product of a 
particular geoo sequence present in a cell (4& The gen* 
end-rule which applies Is thai the higher Ibc value of the 
codon bias calculated for a gene, the more abundant the 
piotcio product of that gene becomes. The calculated 
codoo bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly ail of the pro- 
teins identified (> 95%) have codon bias values Ql>y2, 
indicating they axe highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeast genome 
. {Pig. 6a) show that the majority of proteins present in 
the pioteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the'euneot status of pxoteoma analysis technol- 
ogy. It is clear that even using highly. sensitive aoatylical 
tccboiques. wc ere only able (o visualize and identify the 



moro abundant proteins. Since maty important regula- 
tory proteins axe present opjy at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing rruny more protein* than 
tho. approximate!* Wb gene products* present in .yeast 
celis 1161 Io tho analysis of, for example, the proteomc 
of any human' cells; \h*t* are potentially 5DQOO-100000 
gene product* H?|. Inherent' limitations on the amount 
of protein that can be loaded on J-DB, and the number 
of components thai can be tcsotvtd, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyied. One approach 
that bas been employed to circumvent these limitations 
* is the use of very narrow range immobilized pH gradient 
strips for tho first-dimension separation of J-DE [481. 
Since only those proteins which focus within the narrow 
range will eater the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification/of less abundant proteins. 
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4 Utility of proteome analysis tor biological 
research 

For ibe suctess orproleorjdi?s as j.mainjtxcam appioadi 
(o the analysis" of biological systems it is essential to 
define' bow protcome analysis aid biological research 
projects intersect. Without a clear plan for tho Implemen- 
tation of prolcotao-typc approaches into biological re- 
search projects the full impact of the technology can not 
be realized! The literature Indicates tbat pioteome anal: 
ysis U used both as a database/data archWo, and as a bio- 
logical assay or biological research tool. 



4.1 The p*oteome as a database 

liie use of pioteomics as. a database or data archive 
essentially entails an attempt to Identify all the proteins 
in a cell or species and to annotate each protein with the 
known biotogtcftl tpfonnalion thai b .relevant fot each 
proteio. The level of annotation can, of course, be extep- 
jiTc/The most common implementation of this Idea is 
the separation of proiclos:by high resolution 2-DE, the 
identification of each delected protein spot and'tije 
annotatioo of the protein spots in a 2-DE gel database 
format. This appioach is complicated by the fad thai it 
difficult to precisely define a protcome and to decide 
wbich protcome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the protcome is highly dyqiraic- Pioccsses 
such as differentiation, cell activation and-discase can alt 
significantly change tbc protcome of a rpeciw. This b 
illustrated in Fig. 7. The figure shows Iwo bigb-resolu- 



lion I DE maps of proteins isolated ^^"n- 
Ffc M is from the serum of normal rats, wh.lc Fig. 7B 
L from the serum of rats in seute-pbase serum after 
prior treatment .with an iiunammauoo-ausing.agcnt [4ft 
lib obvious that' the protein patterns axe- signlficaaUy 
different in several area*. rai*in$.the-qutstion of exactly 
.which Pfotcomc U being described. 

Thercfofo, a comprehensive protcome database of a spc? 
* cits or cott type needs to contain all of the parameters 
which describe the suie and the .type of the cells from 
vhjcb the proteins were extracted as well a* tho software - 
tools to search the database with queries which reflect 
the dynamics of biological systems.- A comprehensive 
proteome daUbasc should be capable of quaatiUlivete . 
describing the fate of each protein if specific system! 
and pathways arc activated in Ihe cell Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature- of molecules specifically interacting 
with a protein a* well as the rale of change of theso. 
variables should be. . described. Using these admittedly 
stringent criteria, tfiere is currently no enmtete protcome 
database. A number of such databases are,; however, in 
the process of being constructed. The moat, advanced 
among them, in our opinion, are the yeast protein data- 
base YPD (50) (accessible at hup://vnw_ypd.ccm) and 
the human 2D- PAGE, databases of the Danish Centre 
for Human Genome Ktseiich (UJ (accessible at htlpJf 
biobase.dk7cgirbin/ccUs). While neither can be con- 
Sidercd complete as not alt of tho potential gene pro- 
ducts are identified, both contain .extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples." 

"IX The prVlebme as a biological assay 

The use of protcome analysis as a biological assay or 
research.* toot represents an alkroativa approach to inte? 
grating biology with piotcQmies. Tb investigate the state 
ora system, samples are subjected to a specific proceess 
that allow the quantitative or quaiititire measurement . 
of some of the variable* which describe tho system. In 
typical biochemical assays ona variable («.$.. enzyme 
activity) of a single component {eg., a particular en- 
iyme) is measured. Using proltomics as an assay; mul- 
tiple variables (e.g., expression level, rale of synthesis, 
phosphorylation state, etc) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less TaMeaching prop- 
osition lhao the construction of a comprehensive pro- 
leome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems -and pathways, as long as Ibe interpretation of- 
the results takes into accouol that with current tcchnol* 
' ogy nol all or the variables which describe the. system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis.. as a 
biological assay is when a 2-DB protein pattern gener- 
ated from Ihe analysis of an experimenlal sample is 
compared to an. atray of rcfetence patterns representing 
different slates of the system, under investigation. The 
state of the experimental system at the time tho sample 
was generated is therefore, detarnioed by the quantita-. 
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comparative analysis °f hundreos to a few thousand 
^ffSnUrtiw «ily* of the 2-DE paU«M fur- 

SSTto Ueprolein profiles wbid. correlate w.ih the 
SST«f Ih. system. For this type of«»4» «* 
SLSllMl *«• Proteins arc identified or even vau- 
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aliied. although the results become roo.e .rfoiirwnve a, 
Se%.oid» •» compared, ll b obvic.ua. 

Uiic for a particular stale dramatical* eohiriees this 
S«U> by opening up «e* avenues for exptrimcnta- 
lion. • • 
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banned in lh« Held of toxicology. » *arscttnu 
^.^taia or to rtudy difT«eMi»l »eUva .on of tetts. 

" Smiled, of cours*. by to , Ad ttat 
uyf S wolein spot* •« iwhiiM W the ways mo it 
FJSMLT*- ' s-UtantUl b», £ : W comPl- 
fraction of cellular proteins art detected if ■» total ecu 

delved in H>B because they art art *Mri»I 
cnou Mo b M« by * 
St Ihey do not migrate within ^ 
nft resolved by the gel. because they are not soluble 
undcr^e omdiUons used, or for other reason. 

a different way to use proteome analysis as a biological 
tTXtXt^o! a biological syitecn Is lo take 
advantage or die .vert* of Information contained in 
S E To»ein patten*. 2-DB is referred to « ^im^ 
sional because of the ekctrophoretk mobdity and the 
boelectric points whic* define ihe posdioo of each pco- 
Uin irva 2-DB pattern. In addition to the two dunen, 
used to generate the protein pattern • * 
aoditional data dtoruions arc coiitautcd ^ the pfftem 
patterns. Some of these dimensions such a* protcm 
expression level, phos^orylaUon state 
lion association with other proteins, rote of synthesis or 
degr'adatioo indicate the activity slate 
biological system. Comparative analysis of 2-OE protein 
oat terasl Representing different slates is therefore ideally 
S^bl deletion. identificaUon and an^.s o 
Stable marker?. Once again it must be 
in this type of experiment only a fraction of Ihe cellular 
pcbteihsU analyied. Since many regulatory protein* axe 
of low abundance, this limitation, is a eoocern, partieu- 
- Uriy in cases In wbidi regulatory pathways are being 
investigated. 



S Concluding remaris 

In this report We have addressed three main issues 
rotated to proteome analysb. First, we have ftKuotf 
the rationale for studying proteoses. Second, we have 
ceased the technical feasibility of amJyiiog P^m« 
and dejcribed cuneot prolcomc technology, and third, 
we hive analyzed the utiUty of proteome analysis for bio- 
logical' research* II Is apparent that proteome analysU ts 
•n essentia! tool in the analysis of biological aystema. 
The multi-level control of protein synthesis and degrada- 

. tion in cells mean, that only the direct .anelysu .of 
mature protein products can reveal tbclr correct idenU- 
tiei, their relevant state of modification and/or assoc£ 
Uon and their amount Recently developed methods 
have enabled the idenUfitaUoo of P^/ 1 /^ 
increasing sensitivity levels and -at a high level of tuto- 

■ matioo of the awlytical- processes. A number of tech- 
nicer challenges, -however, remain. While it U currently 
possible to identify essentially any protein spoa tbal can 
be vuualiied by common mining -methods, it is ap- 
parent that without prior enrichment only 8 relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which »rc not visuahzed by 
juch methods. Frequently it is the !°w abundance pro- 
lans ihsi execute key regulatory funcUons. 



We ha've outUned the two principal ways proteome anal- 
ysi, is currently being used to intersect w, x h biological 
Uerch p'rojeeuv. the pioteome as a database oc data 
Sc and p^eome* analysis as a biological. assay. Both 
approaches' have to common that at ptesenl they are ton- 
eeptually and technically limited. Current proleomc data, 
bases typically are limited to one ceil type an* ono Jtate 
of » cell and tliereforc-do not account for the dymmlct 
or biological systems. The use of proteome analysis as a 
biological a^say can provide a wealth of mfonnatioiK wr 
it is limited to the proteins detected and U tbercfote root 
truly protcome-wide. These limitaiions in proteorolcs axe 
to a lorge extent a reflection or the fact that proleM to 
their fully processed form cannot easily bo amplified and 
are therefore difficult to Isolate in amounts sumdenUbr 
analysis or experimentation. Trje fact that to dalejoo 
complete ptoleome has been described further attest? to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, wc anticipate that the 
goal br complete proteome analysis will eventually , 
become attainable. 

■Wc vwutdtliki to acknowledge the funding for vuf work 
/rofli ikt HethnoJSdtnu Fottndctho Seiwe and TtOml- 
ogy Center for Molecular Biotechnology end from tht MH. 
Wt thank Yveri Roche* 0*4 Sob Fronia for providing tht 
ytast gel iAoWn and BUabetta Gtanena far providing the 
rot serum gels shown. • . 
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comprehensive set of gene- disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this tool to help interpret the data from a 
large micro T array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedGene is a relational database, stor- 
ing disease and gene information from NCBI, text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web- based user interface for users to 
query the database (http://hipseq.med. harvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http:// 
www.nlm.nih.gov/mesh/meshhpme.html) and human disease 
categories were selected. LocusLink files were downloaded from 
the LocusLink web site at NCBI (http://www.ncbi.nih.gov/ 
LocusLink/). Official/preferred gene symbol official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
j^^records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chi-square analysis, Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 16 (http:// 
hipseq.med.harvard.edu/MedGene/). In addition, we computed 
the "product of frequency \ which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro- array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro- array fold change in 
expression level. 
' _ Global Analysis. Diseases with at least 50 related genes were 
^^^elected for clustering analysis, and the LPF scores were 
^^^Kormalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using "TreeViewer" (http:// 
rana.Ibl.gov/EisenSoftware.htm). 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinylated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrix U95A oligo-nucleotide micro- arrays. These micro- 
arrays consist of 12 400 probes, which represent approximate^ 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Affymetrix, and then further anaryzed using 
the DNA-Chlp Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 910 Twenty- three human disease category 
headings along with all of their child terms (see the Supporting 
Information, Supplemental Table 1, or visit http://hipseq. 
med.harvard.edu/MedGene/pubIication/s_Table l.html) were 
selected from the 2002 MeSH index creating a list of 4033 
t human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete, 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI. 11 - 12 which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 th , 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at all. 
For the intended use of this study, this lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table 1). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case- insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of False Positives and False Negatives in Unfiltered Data 3 



source of error 



error type 



example 



filter solution 



gene symbol/name false positive 

is not unique 



gene symbol is false positive 

unrelated abbreviation 

gene symbol/name false positive 

has language meaning 

nonstandard syntax false negative 

unofficial gene name/symbol false negative 

nonspecified gene name false negative 



MAG— myelin 

associated glycoprotein 
MAG— malignancy-associated 

protein 

PA— pallid homologue (mouse), 

pallidin (also abbrev. for Pennsylvania) 

WAS— Wiskott-Aldrich Syndrome 
(also the word "was") 

BAG-1 instead of BAG! 

P53 instead of TP53 

estrogen receptor instead of 
Estrogen receptor 1 



eliminate this term 

eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



* In preliminary studies, Medline was searched for co-occurrence of genes and diseases and the resulting output was evaluated to identify error sources that 
were amenable to global Biters. Each error source is categorized by the type of error it causes: false positives are suggested relationships that are not real and 
false negatives are real relationships that are underrepresented The filter solutions used are indicated Note that in some cases, the filter solution itself introduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity if needed 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g., estrogen receptor instead of 
estrogen receptor 1 {ESRf), creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
genes that have an alpha or numerical suffix (e.g„ IL2RA, TGFfi, 
ESR1> etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that it would be clear when linkages were made to 
the gene family versus a specific member in that family. 

To improve performance and accuracy, some pre- selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second, non-English journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely to contain primary data 
about gene-disease relationships were also removed (e.g., Int. 
J. Health Educ, Bedside Nurse, and /. Health Econ). Together, 
these filters reduced the 12 198 221 Medline publications (July 
2002) by 37%. 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, inchisiveness was 
prioritized. To determine the false negative rate in MedGene, 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed in 
several other databases including the Tumor Gene Database 
(TGD), 2 the Breast Cancer Gene Database(BCG), 1 GeneCards 
(GC) 17 and Swissprot 18 Genes were considered false negatives 
if they were represented in at least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer- related genes that were 
supported by at least one literature citation in these hand- 
curated databases, 26 were absent from MedGene, suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion, Supplemental Table 2, or visit http://hipseq.med. 

• harvard.edu/MedGene/publication/s_Table 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest; 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information, Supplemental Table 3, or visit 
http://hipseq.med.harvard.edu/MedGene/publication/ 
s_Table 3.html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 

• argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer. 13,14 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information, Supplemental Table 4, or visit http://hipse- 
q.med.harvard.edu/MedGene/publication/sJTable 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm CTable 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESR1 and 10 to ESR2, whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESR2; 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

• To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information, 



Supplemental Figure 1 , or visit http://hipseq.med.harvard.edu/ 
MedGene/publication/sJFigure l.html). For example, in one 
such cluster shown in Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by the false 
negative rate (~9%) and down by the false positive rate (~26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean ± s.d.) genes associated with it, although the range is 
quite broad with 2359 genes related to breast cancer. 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to anahyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a I-fold difference in mean , 
expression level between breast cancer samples and normal ' 
breast samples. Using MedGene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second- degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedGene/ 
pub!ication/s_Figure 2.html.) Among the 505 previously un- 
related genes, 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to. other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10- fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r= 0.018, p-value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10- fold change or more in 
expression level, the correlation increased to 0.41 (/>-value = 
0.05). 

When we evaluated the micro- array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here. 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the line) and starvation states terms (under the line) clustered together. Within these groups, there is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders, etc.(Supplemental Figure 1: http://hipseq.med. 
harvard.edu/MedGene/publication/s_Figure 1 .html). 

Finally, to validate our findings, we computed similar cor- disease unrelated to breast cancer. As expected, we did not 
relations between the breast cancer expression data and observe an increasing trend in correlation for hyperten- 
LPF scores generated by MedGene for hypertension, a sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer. 3A. The data from an expression analysis of 
samples for breast tumors and normal breast tissue were analyzed to indicate the fold difference of expression level between breast 
tumor and normal sample (cutoff > 3-fold change). The fold changes were plotted against the literature score for the same gene set 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-association. Some well-studied genes, such as BRCA2 (pink circle), are not reflected by a substantial difference in 
expression level. Furthermore, the majority of genes that have no association with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples (y-axis) in relation to the amount of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among estrogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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research articles 



breast neoplasms 


hypertension 


rheumatoid arthritis 


bipolar disorder 


atherosclerosis 


estrogen receptor 


REN 


RA 


ERDA1 


apolipoprotein 


PGR 


DBP 


TNFRSF10A 


SNAP29 


APOE 


ERBB2 


LEP 


CRP 


PFKL 


LDLR 


BRCA1 


AG1 


AS 


DRD2 


ELN 


BRCA2 


INS 




IKri 


ARG1 


EGFR 


kallikrein 


HLA-DRB1 


IMPA2 


APOB 


CYP19 


ACE 


DR1 


HTR3A 


AP0A1 


TFF1 


endothelin 


interleukin 


DRD3 


MSRT 


PSEN2 


S100A6 


TNF 


REM 


LPL 


TP 53 


BDK 


IL6 


KCNN3 


P0N1 


CES3 


DIANPH 






plasminogen 


collagen 


DRD4 


activator inhibitor 


CEACAM5 


SARI 


ILIA 


HTR2C 


PLG 










vascular cell 


hRBsSo 


UTZJ 

rlH 


ACR 


RELN 


adhesion molecule 


cyclln 


CD59 


TNFRSF12 


DBH 


AT0H1 


C0X5A 


ALB 


2L2 


MAOA 


VWF 


cathepsin 


CYP11B2 


CHI3L1 


COMT 


INS 


ERBB4 


MAT2B 


JLB 


HTR2A 


ARG2 




angiotensin 








TRAM 


receptor 


interleukin 1 


SYNJ1 


ABCA1 






matrix 






CCND1 


AGTR2 


metalloproteinase 


INPP1 


OLRI 


EGF 


NPPA 


interferon 


NEDD4L 


collagen 




LVM 


CD68 


FRA13C 


MCP 








transducer of 




insulin-like 


DBH 


IL4 


ERBB2 


lipoprotein 


BCL2 


NPY 


KIT 


BAIAP3 


APOA2 










intercellular 


mucin 


POMC 


MMP3 


ATP1B3 


adhesion molecule 


FGF3 


neuropeptide 


SIL 


DRD5 


RAB27A 



* MedGene results for the top 25 genes associated with breast neoplasms, hypertension, rheumatoid arthritis, bipolar disorder, and atherosclerosis, respectively, 
ranked by LPF scores. The hyperlink to all the papers co-citing the gene and the disease is available at MedGene website (http://hipseq.med. harvard edu/ 
MedGene/). 



Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High- throughput technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret large gene lists. MedGene, a relational database 
derived by mining the information in Medline, was created to 
address this need. MedGene users can query for a rank-ordered 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedGene is an innovative extension of previous text mining 
approaches. Perez- Iratxeta et al. used the GO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders. 8 MedGene takes a broader view 
and includes all diseases and all possible gene-disease relation- 
ships. Furthermore, MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co- cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 6 

A unique aspect of this tool is the ability to assess the relative 
^strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias 15 and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http://hipseq.med.harvard.edu/MedGene/publication/s_Ta- 
ble 3.html). Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link; 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implementa- 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the filters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2- fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Changes in ER- but 
Not in ER+ Breast Tumors 



gene symbol 


fold change (ER+) 


fold change (ER- 


KRTHB1 


1.0 


610.8 


BRS3 


1.2 


89.4 


DKK1 


1.2 


69.8 


ZIC1 


1.9 


59.6 


TLRJ 


1.0 


38.5 


JOAA0680 


2.6 


33.2 


CDKN3 


1.0 


30.6 


EBI2 


4.0 


27.9 


GZMB 


3.8 


21.9 


STK18 


4.7 


18.6 


GPR49 


1.0 


14.6 


mow 


1.6 


14.4 


LAD1 


-1.0 


13.5 


P0LE2 


4.2 


13.0 


HMG4 


4.4 


12.9 


BCL2L11 


-1.2 


12.3 


LRP8 


2.9 


12.2 


CCNB2 


1.0 


11.8 


CCNE2 


4.0 


11.6 


FGB 


-4.3 


11.1 


KNSL6 


2.9 


10.9 


H1F5 


3.0 


10.2 


SERPINH2 


4.6 


10.2 


YAP1 


1.0 


10.0 


LPHB 


-1.3 


-10.4 


TCEA2 


-1.1 


-10.8 


TFF1 


1.3 


-11.4 


COL17A1 


-4.1 


-15.7 


POPS 


1.1 


-16.2 


BPAG1 


-4.6 


-22.3 


PDZK1 


-1.1 


-36.8 


VEGFC 


-2.8 


-51.5 


MUC6 


-1.4 


-64.9 


SERP1NA5 


-1.0 


-83.1 


MEIS1 


-1.6 


-85.9 


CA12 


2.4 


-150.3 



Table 3. MedGene identified a set of relatively understudied, yet highly 
expressed genes in ER negative, but not ER positive breast tumors. All of 
these genes have either never been co-cited with breast cancer or have a 
weak association except those marked with an *. 



reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g., 
BRCAl and BRCA2) and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more in expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5- fold expression level changes. Second, any genes 
with 10- fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER- negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 




must be taken when interpreting experiments that may contain 
subpopulations that behave very differently. The MedGene 



approach identified a set of relatively understudied, yet highly 
expressed genes in ER- negative tumors that are worthy of 
further examination (Table 3). 



In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high- throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high- throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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High-throughput technologies, such as proteomic screening and DNA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene, we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 
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Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate Melds, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g., 
breast cancer, a researcher would have had to scan 130 different 
journals and read 27 papers per day in 1999. 1 This problem is 
accentuated with high-throughput technologies such as DNA 
micro- arrays and proteomics, which require the analysis of 
large datasets involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any mlcroarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between. genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database. 2 However, as these resources are hand- 

•:urated, the labor-intensive review process becomes a rate- 
imiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
the literature or by functional annotation. 3-7 Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez- Ira txeta et a I. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked to inherited disorders. 8 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gene-disease relationships, including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 

Journal of Proteome Research 2003, 2, 405-4)2 405 
Published on Web 06/1 3/2003 




Genomic and proteomic analysis of the myeloid differentiation program 
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Although the mature neutrophil Is one of 
the better characterized mammalian cell 
types, the mechanisms of myeloid differ- 
entiation are Incompletely understood at 
the molecular level. A mouse promyelo- 
cyte cell line (MPRO), derived from mu- 
rine bone marrow cells and arrested devel- 
opments Ny by a dominant-negative 
retinolc acid receptor, morphologically 
differentiates to mature neutrophils In the 
presence of 1 0 pM retinolc acid. An exten- 

Introduction ' 



slve catalog was prepared of the gene 
expression changes that occur during 
morphologic maturation. To do this, 3'- 
end differential display, oligonucleotide 
chip array hybridization, and 2-dimen- 
sional protein electrophoresis were used. 
A large number of genes whose mRNA 
levels are modulated during differentia- 
tion of MPRO cells were Identified. The 
results suggest the Involvement of sev- 
eral transcription regulatory factors not 



previously Implicated In this process, but 
they also emphasize the importance of 
events other than the production of new 
transcription factors. Furthermore, gene 
expression patterns were compared at 
the level of mRNA and protein, and the 
correlation between 2 parameters was 
studied. (Blood. 2001;98:513-524) 
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Studies of normal myeloid maturation from many laboratories have 
identified genes that may play critical voles in myeloid differentia- 
tion^"* Current studies suggest that these events are dependent on a 
cascade of molecular changes that involve complex modulation of 
mRNA transcription. Furthermore, studies of acute leukemia have 
suggested that the disease arises from the accumulation of myeloid 
precursors arrested at early stages of differentiation and associated, 
in many cases, with chromosomal rearrangements that alter the 
structure of specific transcription factors. 5 Nevertheless, the molecu- 
lar events underlying the production of mature myeloid cells arc 
not well understood and appear to use interacting pathways and 
networks, the elucidation of which requires an extensive descrip- 
tion of the molecular components available to the myeloid cell. 

An extensive body of information is accumulating with respect 
to gene expression profiles of mammalian cells. However, much of 
the information available in public databases has been accumulated 
by the use of techniques such as single oligonucleotide chips or 
cDNA arrays that measure fewer than 6000 of potentially 30 000 to 
120 000 transcripts. The more limited range of analyses reported by 
the serial analysts of gene expression (SAGE) 6 - 7 technique accu 
raiely estimates changes in levels of the more abundant mRNAs but 
requires extensive redundant analyses to measure changes in the 
patterns of expression of scarce mRNAs. We have used a modified 
polymerase chain reaction (PCR)-based cDNA differential display 
(DD) method in which single restriction fragments derived from 
the 3 ' end of cDNAs me separated on a sequencing gel.*- 9 Bands 
trout the gel can be identified initially by sequencing, but then . 



comparison of patterns from different samples can be made without 
further sequencing. This sensitive and reproducible method detects, 
in principle, most cDNAs regardless of whether they arc repre- 
sented in existing databases. 

Systematic analysis of the function of genes can also be 
performed at the protein level. This approach has the advantage of 
being closest to function, because proteins perform most of the 
reactions necessary for the cell. The most common method of 
proteome analysis is the combination of 2-dimensional gel electro- 
phoresis (2DE) to separate and visualize protein and mass spectrom- 
etry (MS) for protein identification. 10 Several such analyses of 
yeast and of normal or malignant mammalian cells have been 
performed. To date, however, there have been few studies in which 
both mRNA and protein have been compared by applying analyses 
to the same samples. The studies of Anderson 11 and Gygi" showed 
that there is not a good correlation between mRNA and protein 
levels, in yeast or human liver cells. However, other analyses 
disagree with this conclusion (Greenbaum el al, manuscript 
submitted, and Futcher et al 14 ). Furthermore, global correlations 
between changes in mRNA and protein levels have not been 
examined during the execution of any developmental program. 

The MPRO cell line was derived by transduction of a dominant- 
negative reiinoic acid receptor construct into normal mouse bone 
marrow cells. It is a granulocyte-macrophage colony-stimulating 
factor (GM-CSF)-dependent line arrested at a promyelocytic stage 
of development. 1 5 >' 6 After treatment with aiUrans retinoic acid 
(ATRA) most of the ceils acquire the morphology of mature 
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neutrophils and begin lo produce neutrophil lactoferrin and gelati- 
nase, 2 proteins characteristic of neutrophil secondary granules.' 7 
As such, it offers a valuable model for studying neutrophil 
differentiation in vitro. 

We now report the analysis of mRNA expression changes 
during the process of MPRO cell maturation to neutrophils and 
compare the results with a limited analysis of cellular protein 
composition. mRNA expression changes were studied by combin- 
ing the use of oligonucleotide arrays and DD. A database (dbMC) 
with comprehensive genomic information for myeloid differentia- 
tion program was constructed (accessible at http://www.biomfo.mbb. 
yale.edu/expression/neutrophil). We have grouped the changes in 
mRNA levels of a large number of genes into 6 patterns, with 
implications for the genetic program of myeloid differentiation. 

We also compared 2-dimensional high-resolution gel eiectro- 
phoretograms from control cells and cells differentiated for 72 
hoiirs in the presence of ATRA. Fifty protein spots whose relative 
intensity changed prominently during differentiation were exam- 
ined by mass spectrometry. The results suggest a poor correlation 
between mRNA expression and protein abundance, indicating 
that it may be difficult to extrapolate directly from individual 
mRNA changes to corresponding ones in protein levels (as 
estimated from 2DE). 



Materials and methods 

Cell lines 

MPRO cells aad HM-5 cells provided by Dr Schickwann Tsai (Fred 
Hutchinson Cancer Research Center, Seattle, WA) 15 were used throughout 
the study. The cells proliferated continuously as a GM-CSF-depcnden! ceH 
line at 37 a C in Tscoves modified Dulbccco medium (Gibco BRL, Grand 
Tsland, NY) supplemented with 5% to 11*% fetal calf serum ^Gibeo BUI,) 
and 10% HM-5 -conditioned medium as a source of GM-CSV. Morphologic 
differentiation of the blocked MPKO promyelocytes was induced by 
treatment with 10 p,M ATRA (Sigma, St Louis, MO). Controls were 
cultured in the absence of ATRA but with the same volume of ve- 
hicle lelhanol). 

RNA Isolation and differential display 

After exposure to 10 p.M ATRA for 0.. 24, 43. or 72 hours, total cellular 
RNA was isolated from MPRO cells using TRIzol reagent (Life Technolo- 
gies, Gaithersburg, MD). cDNA was then synthesized using a T-? Sal-Oligo 
d(T) 32 primer as described previously. The double-stranded cDNA was 
digested with 1 of 9 different restriction enzymes (Apa\ t Bglll, Bam\\\ % 
Eoglt EcoK\ y A/indlll, Xbal, Kp»\, and SphY) and ligatcd to Y-shaped 
adaptors with a complementary overhang. DNA fragments were then 
amplified by PCR as described previously.* 1 * PCR products were separated 
on a sequencing gel of 6% polyacrylainide with 7 M urea. The gel was dried 
and exposed to x-ray film. Genes from differentia] display gels, whose 
maximum intensity changes equaled 2+ on a scale of I + to B + , were 
recorded as significantly changed.' 9 Individual DNA bands were recovered 
from the gels, amplified by PGR, and sequenced. 

Oligonucleotide chip analysis of RNA samples 

Ten micrograms total RNA from each sample (0, 24, <8, or 72 hours) was 
used to prepare cDNA. This cDNA was transcribed with T7 KNA 
polymerase lo prepare a fluorcsccnlly labeled probed 21 Each sample was 
hybridized lo mouse array chip (MulIK Array; Asymetrix, Santa Clara, 
CA) containing oligonucleotide probe sets corresponding lo approximately 
7000 known genes or ESTs represented by UniGene clusters. 22 cDNAs 
were considered present if their probe set results were rated as such by the 
GencChtp software (Atrynietrix) and if the average difference (AO) 
between perfect match and mismatch probe pairs was not less 100 U. If a 



gene was represented by more than one array probe set, the average of all 
probe sets for the gene was taken. Genes with AD values between 100 and 
200 were considered unchanged because of their low expression levels. 
Those genes with AD value* equal, to or more than 200 U al one time point 
were further studied by rcscaJing, threshold, and normalization methods 
described in the MIT Center for Genome Research Web site. ! * A value of 20 
was assigned to any gene with an AD below 20 at seme time point. 

Biolnformatics and database development 

AH the sequences cr gene fragments were searched using Blast against 
GenBank and TIGR gene indices. A database of genes or ESTs whose 
expression levels changed during myeloid differentiation was constructed 
containing information for each band or gene. This included GenBank 
matches, Locus UnV or Unigene clusters, expression patterns^ tissue 
distribution, synonym(s) protein name, gene name(s) { notations of possible 
functions, poly A signal and sequence quality, and hyperlinks to the 
database searches, sequence trace files, and related references. Ail gene data 
were dicn gathered into a cluster file. Supplementary information is 
available at http7/bioinfo.mbb.yale.edu/expression/neutrophil. 

Classification and analysis of DNA fragments 

Sequences from differential display analyses were classified as representing 
known genes, ESTfc, genomic sequences, or novel genes as described. 19 ' 23 
Known genes from both differential display and arrays were clustered into 
27 functional categories and searched against SWISS-PROT (http:// 
www\cxpasy.cor.nrc.cVcgi-bi^i/sr^t-semch-fuJ) or P1R (http://www.pir. 
gcorgctown.edu/). Information such as function, subcellular location, 
family and super family classification, map position, similarity, synonym(s) 
protein name, gene name(s) t and so on was recorded in a variety 
of databases. 

Northern blot analysis 

Thirty micrograms total cellular RNA per lane from time-course MPRO 
cells wer e loaded onto 1.2% formaldehyde-agarose gels, then transferred to 
Hybond-N+ membranes (Amcrsham Pharmacia Biotech, Uppsala. Swe- 
den). Atler standard prchybridiEation, membraaes were hybridized over- 
night at 65°C with radiolabeled cDNA probes (ordered from Research 
Genetics according to their dbEST Image ID). Membranes were washed at a 
final suingency of 60°C in 0. 1 X SSC. 

Immobilized pH gradient 2-d intension a! gel electrophoresis 
and mass spectrometry 

Induced MPRO cells collected al 0 and 72 hours were lyscd with lysis 
buffer (540 mgurca, 20 mgdithiolhrcitol, 20 u,LPharmalylc [3- 10], 1.4 nig 
plumy Imethylsulforiyl fluoride, I p.g each aprotinin, teupepnn, pepstatm A, 
and anripain 50 p,g TLCK, and 100 p,g TPCK/i ml). We applied 100 pi 
each MPRO cell h/sate (2.5 x lO^ells/iOO u.L) to immobilized pH 
gradient (ITG) snips (pH 3-10 L; Amershara Pharmacia Biotech), and IPG 
electrophoresis was conducted for 16 hours (20 100 Vh) using an immobi- 
line Drystrip Kit (Amcrsham Pharmacia Biotech). Electrophoresis in the 
second dimension was carried out in b 12% sodium dodccyl sulfate- 
potyacrylamide gel electrophoresis (SDS-PAGE) gel with the Laemmli- 
SDS continuous system in a Protean II xi 2-D cell (Bio- Rad) run nt 40 mA 
constant current for 4.5 hours. Proteins were detected by Brilliant Blue 
G-coiloidal staining. 3 * Protein spots were excised from the gel and digested 
with trypsin. ACTH clip (average [M+It] 2466.70) and bradykinin 
(average [M+H] 1 061 .23) were used for calibration of peptide masses. One 
microliter sample digest was mixed with 1.0 ^.L a-cyano-4-hydioxy 
cinnamic acid (4.5 mg/mUn 50% CFT 3 CN. 0.05% TFA) matrix solution and 
1 uL calibrants (100 fmol) each. The spectra of die peptides were acquired 
in reTiector/dcIaycd extraction mode on a Voyagcr-DE STR mass spectioro- 
eter (Perseptive Biosystems, Foster City, CA). Peptides were identified 
using the ProFound search engine. 19 
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Results 

Differentiation of MPRO cells 

Figure t illustrates die morphologic changes in an MPRO cell 
population representative of those used for RNA expression 
analysis. Undift'emitiateu MPRO cells resembled promyelocytes 
under the light microscope (Figure 1 A), After induction with ATRA 
for 24 hours, the cells morphologically differentiated into metamy- 
elocytes (Figure IB). At 48 hours, die cells further developed 
into metamyelocytes and band neutrophils (Figure IC). At 72 
hours, nearly 100% of MPRO cells became mature neutrophils 
(Figure ID). 

Identification of mRNAs by differential display assay 

MPRO cellular mRNAwas analyzed at 0, 24, 48, and 72 hours after 
ATRA treatment Nine restriction enzymes were used in a 3 '-end 
DD approach. During MPRO differentiation, 1109 fi-agments 
corresponding to B37 transcripts were found to change substan- 
tially in expression levels (Figure 2). These represented approxi- 
mately 279 known genes, 112 ESTs, and 59 putative new genes, 
each with a perfect or fair polyadenylation signal at an appropriate 
distance from the oligo-dT priming site. The gene information 
detected by DD was collected in database dbMCd. 

Identification of mRNAs by oligonucleotide chip assay 

We used an oligonucleotide chip containing 13 179 probe sets 
corresponding to approximately 7000 murine genes to analyze 
patterns of mRNA expression in the same RNA samples used for 
DD. The information obtained by oligonucleotide arrays was 
collected in the database dbMCa. 

We clustered the genes by their similarity to idealized 
expression patterns. For instance, the expression pattern of an 
ideal gene that is ovcrexpressed (high) at time 0 and underex- 
pressed (low) at 24, 48, and 72 hours, would be high-low-low- 
low (HLLL). Overall we have (2 4 -2) idealized patterns exclud- 
ing HHMH and LLLL. Pearson correlation was used as the 




Figure 1. Morphology of MPRO cells during differentiation. Mf»RO ceBs ware 
induced as described in 'Materials and methods,* concentrated by cytospin. ar.d 
Wrisbt-Giemsa stained. (A) Uninduced MPRO cells. (B) MPRO celts induced with 
ATRAicr 24 hours. <C> MPRO cells induced with ATRA for 4 8 hours. (O) MPRO cells 
Induced with ATRA for 72 hours. 



Genomb 1 % Mitochondrial 
Sequences ° /Sequences 




Figure 2. Distribution of genes obtained by DD assay. MPRO cell mRNA was 
analyzed at 0, 24, 48, and 72 hours after ATRA treatment; 1109 fragments 
corresponding to e37 transcripts were found to change substantially in expression 
levels. The toiot 837 transcripts were classified into 6 categories according to the 
btolnformatic analysis. Percentages show the gene distributions in these 6 catego- 
ries. Information for each transcript was collected In database dbMCd. 

measure of siruilaricy of each gene expression pattern, 
x - (XiyX2jc i3 Xi) to each of the 14 idealized patterns 
y - (yi^.Vjj^). The 4 entries of x and y corresponded to the 
4- dimensional gene expression levels at 0, 24, 48, and 72 hours, 
respectively. Each gene was assigned to a chaster labeled by the 
idealized pattern that had the maximal correlation with thai 
gene. We selected only genes that hybridized well compared 
with the background (considered "present" by GcncChip soft- 
ware) and had maximal AD amplitude greater than 200 U in at 
least 1 of the 4 stages. Wc further tabulated the 14 patterns 
according to whether the gene expression changed at early 
(0-hour), intermediate (24- and 48-houv). and late (72-hour) 
time points and whether gene expression monotonically in- 
creased (up-regulated), mouotomcaUy decreased (down-regu- 
lated), or was not monotonic (transient). Table 1 shows 8 
clusters of 104 genes that had significant changes of mRNA 
levels, arranged according to the temporal stage and the 
monotonic/rransient changes of expression levels. 

Principal component analysis determined whether wc could 
comprehensively present multidimensional data (4-dimcnsional in 
our case) in a simple 2-dhnensional graph. First, we found the 4 
principal components, which were the axes of the most compact 
4-dimensional ellipsoid that encompassed the 4-dimensiona! cloud 
of data. Each axis was a different linear combination of the original 
4 variables. Then we verified that the first 2 principal components 
(the first 2 largest axes of the ellipsoid) captured most (95.2%) of 
the variation of the data. Therefore, the data could be faithfully 
projected (with a minor loss of information) into a 2-dimensional 
graph, with the 2 largest principal components as the x- and y-axes. 
As shown in Figure 3, genes tend to coalesce in clusters, according 
to their labels determined by their similarity to an ideal expression 
pattern. In summary, a genomic (global) picture of the distribution 
of genes according lo their similarity to predetermined idealized 
multidimensional expression .patterns is concisely displayed in a 
2-dimerisional graph. 
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Tabta 1. Genas differently regulated during the different stages of mouse promyelocyte cell line differentiation process 

Timing 



Category Early Middle Late 



Up-regulation 


LHHH (n = 10) 


LLHH (n - 6) 


ULH(n=13) 




Ated P2rx1 Itgb2 U1r2 Lcn2 Hpt5 


Pk&t Cybb Pfc PiraS Cd53 Hngr2 


tt1a CsdrW Crs/SIOOaB L-CCR ass 




Cebpb H2-DEtohi6Zyx 




Aldol Rac2 Fpii Cfsd Ubb Ptmb4 


Down-regulation 


HLLL (n = 11) 


HHLL(n = 1) 


HHHL(n=37) 




Tcrg-V4 LyG4 Ctsg Spl2-1 McptS 


Mpo 


Actx \tt2 EL2 Rpt1 9 Actb LySe Atf1 Hlst2 




Myc Myb Tlr4 Npml Erh Hsp60 


\ 


Psma2 Gnac Zfp36 ti4n LfbrShfdgl 
Max RpsB Csf2rbt Slpi Tctexl Tpl 3tf3 
CnrfGys3 SlclOal Ctsb Seppl Rin3 
Ccnh2 Sl00a9 C!t1 Hls*5-2ax Rels 
Copa Gstml Gnb2-n>1 Grn RPL8 


Transient 




LLHL (n * 9) 

SeH Kta PiraB Pirb Lst1 Ltf Senia4d State MnipO 
LHHL(n => 17) 

Ccbpa LyzB Fcgr3 Arf5 Lompt Sxat3 Csf2ra Osi 
Actg Sfpil Gpx3 Ptprc PrtnS trfl Rps6ka1 
LiWrMyln 





Arrays of Affymetrix Mullk containing 13103 probe sets corresponding to 12002 GenBank accessions were used for hybridization. Arrays were hybridized wfth 
strcplavidiivphycoerythftn (Molecular Frobes) biotirvtabeled RNAand scanned. Intensity for each feature of the array was captured using SciwcNp software (Affymelrnc), end 
s single raw expression level for each gene was derived from Ihe 20 probe pairs representing each gene using a trimmed mean algorithm. For each gene, an .AD of 24-, 48-, and 
72-hour samples was calibrated by dividing the slope of Ihe linear regression line for a graph wllh the x-axis the AD of 0-hour probe sets and Ihe y-axfe the AD of Ihe respective 
time point (24, 46, or 72 hours). A threshold of 2D U was assigned to arty gene with a calculated expression level below 20 because discrimination of expression below this level 
could not be performed with confidence. 36 Each gene expression profile was categorized as described in Tables 3. 4, and 5. For the 4 time points, the minimum AD of the 
relatively higher group (MIN-H) was divided by the maximum AD of the relatively low group (MAX-L). and those genes whose MIN-H/MAX-L greater than 2 were selected as 
meaningfully regulated. Genes were sorted tn descending order based on the MIN-H/MAX-L Genes In boldface are those whose expression level was tn the top 20% (ie, 
maximum AD of 4 time points greater than 3000), and genes in italics are those in the bottom 20% (ia, maximum AD of 4 lime points less than 300), The differentiation period 
was grouped into 3 stages: early (0-hour), middle (24-hour and 46-hour), and late (72-hour) stages. 

AD indicates average difference; gene symbols are expanded In an Appendix at the end of this article. 




Figure 3. Gene clusters In the first 2 principal component spaces. Principal 
component analysis allowed us to present the multidimensional data (In this case. 
4-dirRensior.ar date of each gene expression pattern) In o simple 2-dimensional 
graph. We derived the 4 principal components, which are a linear combination of the 
standardized expression intensities (zero mean and unit variance) at 0, 24, 48. and 
72 hours. The first 2 principal components captured most of the variation of the data 
(apprcotiniatefy 85%). Therefore, the data can be displayed (with a minor toss of 
information) in a 2*firnensicnal graph. The first and second principal compcrtents, cl and 
c2,aie given by the linear co:nWrurJons t>, = 0.747 * nl - 0.11 - n2 - 0.656 - n3 + 0 • 
n4 end cj = 0.278 • n1 + 0.353 • n2 +■ 0.233 - n.3 - 0.863 - n4. whe;e nl, n2, n3, 
end n4 are the reseated and standardized expression levels at 0, 24, 48, and 72 
hours. iBspectively. The exes leu ends cl and c2 stand for the first 2 principal 
components. In this paper we used the Pearson correlation to measure the similarity 
of each gene with the idealized expression patterns, as opposed to the EttcKdep.n 
distance we used in a previous work. 1 * because clusters were better separated using 
Oils measure. In both cases, we presented the data in the 2-d'mensiona' space of the 
lowest principal components. The data had a tendency to be drculariy distributed 
when we used the Pearson conelatton as a distance measure. 



Correlation between array and DD analyses 

We have previously demonstrated a correlation coefficient of 0.93 
between visual estimates of changes in band intensity on DD and 
Phosphorimager System (Molecular Dynamics, Sunnyvale, CA) 
estimates of band intensity and a correlation coefficient of 0.88 
between hybridization intensity changes of mRNA on Northern 
blot analyses and changes in band intensity on DD. 19 In a few cases 
there were clear discrepancies in the pattern of expression of a 
gene, as estimated by DD and by oligonucleotide chip analysis. We 
chose the 6 most extreme cases and examined the levels of mRNA 
change for these genes by Northern blot analysis (Figure 4). In 5 
cases, the Northern blot results agreed with the results of the DD 
analysis, whereas the results of Gnb2-rsl disagreed with the 
oligonucleotide array but duplicate bands from DD showed a 
relatively high level of expression in the 0 time sample that did not 
correlate with the Northern blot (Table 2). One possible explana- 
tion for these findings was the change in the relative use of different 
polyadenylation sites after the addition of ATRA to the MPRO cells. 

Constructing a database for mRNA level changes during 
myeloid differentiation 

Based on the data obtained above, an in-house database (dbMC) 
was constructed that included 2 subdatabases, dbMCd and dbMCa, 
for collecting gene information from DD or oligonucleotide arrays, 
respectively. Each entry in dbMC is accompanied by a so-called 
executive summary. The linkage between dbMCd and dbMCa was 
established by UniGene ID and cluster ID. dbMC contains the 
temporal expression patterns of genes during the MPRO cell 
differentiation process, including not only products represented in 
public databases but also novel transcripts. 
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Figure 4. Northern blot analysis of selected mRNAs. Equivalent amounts of RNA 
from MPRO cells Induced byATRA at different time point* (0 hour, ?A hours, 48 hou«T£, 
and 72 hours) were resolved by formaldehyde-agarose gel electrophoresis, stained 
io verify the amount of leading. Eleven genes were separately probed on the RNA 
finers. The gene symbol of each probe was listed at the left of a related Northern blot 
resuit. Detailed Information on these 11 probes was listed In Table 5. One of the 
RNA-btctted membrane photographs Is shown with methylene blue -stained 23S and 
1BS RNA subunits demonstrating the quality and quantity of RNA loaded in 
individual ianes. 



Analysis of gene expression patterns during MPRO 
differentiation 

Many of the genes identified in this study were found in myeloid 
cells or were implicated in myeloid development for the first time. 
We detected 8 cytokines 25 and chemokines whose mRNA levels 
changed more than 5-fold by arrays and 2-fold by DD during the 
maturation of MPRO cells (see our Web site, http://T5ioinfo.mbb. 



yale.edti/cxpression/neutrophil). Among these were 2 members of 
the CC chemokine family. Interleukin-1 a (IL-la) was up-regulated 
at the late stage of differentiation (LLLH pattern. Table 1). 

mRNA for approximately 52 receptors was delected by one or 
the other method. A number of the receptors known to be present on 
mature neutrophils showed late induction of mRNA, and their 
levels of induction were high, indicating that the expression of 
these products is a prominent event late in neutrophil maturation 
(Table 3). Rarely was mRNA for receptors down-regulated, 
consistent with myeloid maturation being accompanied by increas- 
ing responsiveness of the cell to a variety of external stimuli. 

Expression of mRNA for granule proteins 

Neutrophils contain several types of granules that develop at 
different stages of myeloid rnaturation. 3 ' 17 ! 26 Levels of mRNAs 
encoding secondary granule proteins, such as lactoferrin, increased 
as the cells matured (Table 4). The level of mRNA for Mmp9, 
reported as a tertiary granule protein, increased markedly between 
24 and 48 hours after the induction of differentiation, whereas 
mRNAs for secondary granule proteins either increased less 
markedly or showed a maximum increase by 24 hours. mRNAs for 
several primary granule constituents, such as myeloperoxidase and 
csthepsin G, were present in unstimulated cells and decreased as 
the cells matured There was a discrepancy in the measurements of 
proteoglycan mRNA by DD and oligonucleotide chips, but North- 
ern blots showed that it reached a peak at 48 hours and then 
declined (Figure 4). Cathepsin D is reported as a primary granule 
protein, but its pattern of mRNA expression more closely re- 
sembled that of secondary granule constituents, fn addition to 
known granule components, mRNAs for several other cathepsins 
were up-regulated during myeloid differentiation, in paraJlel with 
or later than the tertiary granule protein mRNAs. 

mRNAs for transcription factors 

Transcription factor genes, including several identified at the sites 
of consistent chromosome rearrangements in acute myeloid leuke- 
mia, have been implicated in normal myeloid differentiation and in 
the expression of neutrophil proteins. 2 * 5 * 27 However comprehensive 
information concerning the expression of these transcription fac- 
tors during myeloid development is not readily available. There- 
fore, we compared gene names and identifiers in our databases io 
those of the transcription factor database Transfac (http:// 



Table 2. Expression patterns of genes detected by Northern blot analysis 



Gene 
symbol 



Gene 
secession 



AD value by array 



24 h 



48 h 



72 h 



Intensity by DD 



Cebpa 


M62362 


33 


212 


162 


4fi 










Cebpb 


X6260O 


3D0 


1248 


1380 


1903 










Cehpd 


X61B00 


157 


252 


168 


430 










Cebpe 




















Myb 


M1Z648 


892 


356 


230 


435 










Sip} 


U73004 


617 


5G1 


763 


402 


1 


2 


3 


3 


Prg3 


W45H34 


153 


259 


339 


345 


5 


1 


1 


2 


Gnb2-rs1 


X75313 


4231 


3623 


3215 


3403 


4 


4 


1 


1 


Ly6e 


U0426B 


3061 


53S1 


2844 


12H2 


3 


2 


1 


i 


L &P 1 


M9C316 


65 


376 


840 


28 


2 


3 


5 


6 


Actb 


X0376S 


3095 


358H 


3976 


2<34 


1 


2 


3 


2 



the V5 



ene symbol end gene accession refer tc National Center for fliotechnoiogy Information databases end, In particular, to Locus Lin*. AD vslw Is the average difference In 
We of hybridization intensity between th« set of perfect* matched oSnnnutfectides and the set of mismatched oligonucleotide in the oligonucleotide array 3?nd 
tntensrties from DD were semlquanirfied on a scale from 1 (+)to 8 (+ + + + + + ++). These estimates are shown as boMace numbers in this table » Both AD value end 
intensity of genes were studied st 4 time points corresponding to MPRO ceils induced for the indicated times. 

DD indicates differential dts-play; MPRO, mouse promyelocyte cell line; for gene symbols, see the Appendix at the end of this article. 
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Table 3. Receptors expressed during myeloid differentiation process 



Maximal fold change 



Gene symbol 



Gene accession 



Oh 



AD value by array 



24 h 



43 h 



2 or more, less than 3 



3 or more, less than 4 



4 or more, less than 5 

5 or more 



Bzrp 

Cmkart 

Crry 

Csf2rh1 

HtrSa 

MGpr 

MPPIR 

TCRGB 

Tnfrsfla 

Cmkbrl 

Crhr 

Csf2ra 

Ebl3 

Gridl 

Imgr 

Il2rg 

Ld:r 

P40-8 

Plaur 

Rang 

Srbi 

Cr2 

Csf2rb2 
Fcartg 
Fcgr2b 
(fngrS 

Nr4e1 

I1U2 

C5r1 

Drd2 

Fcgr3 

Pprl 

GCR 

L-CCR 

NMDARGB 

F'2ncl 

Piral 

PiraS 

PiraG 

FMrb 

Sell 

Tcrg-V4 



D21207 

K99501 

M34173 

M 34397 

216273 

X6406Q 

AA1 16789 

M2B053 

M59377 

U2B404 

X72305 

W8507B 

AF015114 

D10171 

J05265 

U21795 

X544U 

J02870 

X62701 

M34476 

U37799 y 

M29281 
M2BB55 
J05020 
X0464B 
U6953FJ 

X16995 

X5S769 

L05630 

K55B74 

M14215 

L22161 

AA24D711 

AA034R46 

AAB2021 1 

XB4695 

U96G32 

U96666 

U9G687 

U36689 

M25324 

M54993 



641 

508 
433 
318 
166 
536 
232 
165 



221 
121 
171 
187 
128 
141 
205 
1399 
649 
312 
102 
126 

63 
209 
2398 
1703 
1 



482 
185 
0 
1 
0 
2 
48 
2 
79 
0 
274 
122 
191 
46 
1650 



658 
447 
384 
345 
272 
409 
B4 
212 
1 

244 
200 
372 
270 
164 
263 
184 
1653 
677 
443 
113 
232 

138 
249 
2766 
1652 
2 



1796 
434 
0 
1 
59 
0 
175 
2 

346 
43 
391 
635 
445 
104 
7B 



881 
378 
506 
410 
273 
408 
63 
244 
1 

504 
250 
402 
428 
150 
327 
231 
1665 
361 
476 
114 
132 

243 
437 
3365 
1431 
2 

202 

2672 
808 
0 
1 

141 
0 

314 
0 

530 
172 
954 
2014 
966 
570 
65 



687 
664 
506 
241 
339 
649 
381 
299 
1 

636 
355 
254 
148 
257 
251 
477 
3968 
640 
734 
218 
258 

77 
111 
6751 
4605 
3 



3816 
1078 
2t9 
2 

671 
0 

2056 
0 
744 
378 
1B74 
1716 
747 
20 
315 



Receptors ere identified as present whose maximal AD vatuss were more than or equal to 200 U in this study. Genee were sorted by their expression patterns as foMows: 
first by the average difference value, then by the difference between minimum and maximum AD for the4 time points, and last by the alphabetical order of gene symbols. Genes 
were ordered according to the mavimal fold change of AD values. Abbreviations of gene names are taken from gene symbols listed in Ihe Locus link portion of the Natlona! 
Center for Biotechnology Information database whe*fc available. Numbers in hold denote those gene expression patterns obtained by differential display twiner than hy 
oligonucleotide errey sssays. The other information is presented as in the legend to Table 2. 

AD Indicates average difference; gene symbols are expanded In an Appendix at the end of this article. 



www.transfac.gbt-brawnschweig.de/TRANSFAC) and determined 
which factors contained in this database were present at detectable 
leveJs in MPRO cell mRNA, using Affyinetrix software for the 
criteria for inclusion of mRNAs from approximately 200 murine 
transcription factors probe sets on the oligonucleotide chip. Of 
these, 5-1 were expressed and 1 3 showed changes of 3 -fold or more 
in chip signal (Table 5). 

The changes in certain transcription factors, such as the moderate 
down-regulation of myb and myc and the up-regulation of the Max 
dimerizalion protein MAD, were consistent with the shift of the cells 



Gum a proliferative to a differentiated state. 2fi Some changes are more 
difficult to explain, such as the up-regulation of DPI , a partner for E2f 
factors in the regulation of S-phase genes, and the mild up-regulation of 
the Id genes, comirtouly associated with'an inhibition of differentiation 
by competition with bHLH transcriptional activators. 29 

The C/EBP family has been extensively studied with respect to 
myeloid differentiation. 2 - 30 Absolute levels of the C/EBP a and o 
mRNAs were low, probably at the borderline of significance for (he 
oligonucleotide chip assay, whereas the level of C/EBP p appeared 
higher. In addition, there were discrepancies between the chip 
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Table 4. Granule constituents expressed during mouse promyelocyte cell line cell differentiation 



Granule constituent 


Gene symbol 




Gene accession 


Oh 


AD volue by array 
24 h t 48 h 


f 2. n 


Azurophil (primary) granules 


















Man2c1 




AA1B18B0 


178 


134 


99 


164 




Cfsb 




M65270 


442 


480 


59S 


389 




Ctsd 




X528B6 


214 


1087 


1628 


2784 




Ctsg 




M96801 


1509 


405 ■ 


46 


285 




E12 




U04962 


658 


1273 


843 


157 




E1a? 




AA583D1G 


47 


159 


134 


163 




Gus-s 




W63B26 


544 


226 


266 


254 




Lyzs 




M21050 


0 


n 


1 


3 




McptB 




X7B545 


831 


268 


66 


. 491 




Mpo 




X15378 


3788 


3009 


776 


692 




Pra 




X1B133 


2621 . 


2653 


2920 


9859 


Possible granule proteins 


















Ctsc 




AA144887 


252 


194 


342 


578 




Ctse 




X97399 


1 


3 


4 


5 




Ctsh 




U06119 


45 


124 


195 


156 




Ctsl 




X06086 


16 


11 


31 


237 




Ctss 




AA089333 


12 


9 


68 


463 


Specific secondary granules 


















Cpa3 




J0511B 


621 


270 


90 


601 




Cd36l2 




AB008553 


113 


93 


157 


187 




Cnlp 




X94353 


80 


479 


704 


626 




Cybb 




U43384 


8 


24 


91 


128 




Eor2 






0 


1 


1 


2 




Fpr1 




L22181 


178 


220 


235 


845 




Hgb2 




X14951 


0 


2 


4 


2 




Lcn2 




W13166 


916 


3513 


3931 


6036 




Ltf 




J03298 


19 


162 


333 


138 




M3P 




W45B34 


5 


1 


1 


2 




Mmp13 




X66473 


44 


43 


72 


178 




Ngp 




1.37297 


2661 


4782 


2311 


6912 


Tertiary granules 


















Mmp9 




227231 


0 


1 


2 


2 



Shown ansthe possible granule protein cDNAs represented on the oligonucleotide atTays, fiorted by their expression patterns as follows; first by the average difference AD 
value, then by the granule types, and last by the alphabetical order of gene symbols. Data are presented as described in the legend to Table 3. 
AD indicates average difference; gene symbols are expanded in an Appendix at the end of this article. 



estimates and the mRNA levels observed by Northern blotting with 
specific probes for these genes. In particular, the latter method, 
more sensitive and specific, showed that C/EBP a began to decline in 
the most mature cells, whereas C/EBP 8 mRNA declined progressively 
beginning at 2-1 hours after the onset of differentiation. 

C/EBP c is a more recently cloned C/EBP family member Previous 
studies indicated it is expressed in a large array of human leukemia cell 
lines blocked at various stages of differentiation and that it is up- 
regulated during granulocytic differentiation. 31 A C/EBP e probe was 
not included in the oligonucleotide chips, and this inRNA was not 
detected by DD. Therefore, we examined the C/EBP e expression 
patterns by quantitative PCR and Northern blot analysis (Figure 4). 
CEBPc exon I was PCR amplified from MPRO RNAs using primers 
RY48 (AGOCCCCGACACCCITGATa\) and RY49 (TGGCACACT- 
GCGGGCAGACAG) 32 The results showed that C/EBP e is expressed 
throughout myeloid differentiation, with expression levels increased 
moderately in the later stages. 

We detected a number of other transcription factors that are 
broadly expressed or that have been reported in otheT studies of 
hematopoiesis (Table 5). Some of the factors that were most 
strongly induced during differentiation have been studied in other 
contexts but.not previously implicated in hematopoiesis, such as a 
mammalian homologue to the Drosophila enhancer of split gene, a 
transcriptional silencer. The mammalian gene is expressed at 
relatively high levels as measured by the oligonucleotide chip and 



is a candidate for mediation of the silencing of growth-related 
genes in the manuring neutrophil. Another candidate transcriptional 
silencer, Tiflb, may serve as a corepressor for the KRAB domain 
family of zinc finger transcription factors and also may mediate 
binding of the heterochromatin protein HP1 to DNA. 33 

There were 26 transcription factors whose mRNAs showed no 
significant changes by oligonucleotide chip analysis and were not 
identified as differentially regulated genes by differential display 
assays. PU.l, a factor necessary for the production of neutrophils 
and the expression of several neutrophil genes,- 14 showed less than a 
3 -fold increase in mRNA, below the threshold for a significant 
change. Other candidate hematopoietic transcription factors, such 
as PEBPUB2 (AMU), GATA-1, and SP-2, were represented on 
the oligonucleotide chips, but their mRNA levels were so low that 
they were reported es absent in this study. The possibility that small 
changes in the levels or ratios of some transcription factors could 
produce marked changes in transcription potentially limits the 
ability of data generated by present methods to explain transcrip- 
tional changes during differentiation. 

Protein expression patterns of (VI PRO cells during 
ATRA induction 

We visually compared the 2DE patterns from MPRO celis at the 
same time points used for mRNA analysis. In most cases the 
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Table 5. Transcription modulators presented during myeloid differentiation 



Maximal fold change 



AD value by array 



Less than 2-fotri 



2 or more, less than 3 



3 or more, less than 4 



A or more, less than 5 



ene symbol 


Gene accession 


Oh 


24h 


4ft h 


72 h 


Zfp11'6 




2630 


29S9 


2795 


2515 


Rif3 


W13502 


3 




2 


1 


Gata2 




562 


770 


472 


730 


Hmgl 


J04179 


337 


■ 348 


177 


232 


Idb1 




455 


787 


721 


637 


Max 




256 


224 


312 


172 


Nfatc2 


A/VObuL'iJj 


2313 


3218 


2396 


2542 


Pm1 




173 


281 


329 


305 


Rarg 


M34476 


102 


113 


114 


215 


Re!a 




297 


260 


304 


244 


Sox15 


W53527 


419 


461 


484 


B37 


Ybx1 


MoZom 


643 


459 


472 


495 


2fp162 


Y12838 


671 


734 


720 


992 


Cebpd 


vc4Dnn 
X.OloUU 


157 


262 


168 


430 


Idb2 


M69293 


244 


210 


310 


604 


Jundl 


W29355 


1274 


2002 


1434 


3085 


Lyl1 


X576B7 




342 


347 


B91 


Nfe2 


L09600 


458 


743 


1042 


505 


Nfkbl 


L28117 


953 


2044 


1876 


2034 


Pbxi 


AF0201P6 


611 


303 


345 


212 


Gtpn 


A34693 


, 375 


784 


091 


520 


Ttflb 


U67303 


673 


659 


420 


663 


Trp53 


P10361 


259 


149 


125 


361 


Usf2 


U12283 


120 


185 


285 


4Q? 

toz 


Ybx3 


L35549 


96 


169 


21 G 


1 19 


Zfp216 


AA510137 


82 


151 


204 


106 


Irfl 


M21055 


85 


207 


278 


198 


Klf2 


U25096 


62 


86 


246 


77 


Myb 


Ml 284 B 


692 


356 


230 


435 


Sl?.t3 


AA396029 


484 


1057 


1012 


290 


Tfdpl 


008639 


307 


550 


505 


1093 


Ccbpb 


X626C0 


390 


1248 


1380 


1903 


StraH 


Y07836 


223 


363 


510 


935 


Cebpa 


M62362 


33 


212 


182 


44 


Grg 


X73359 


89 


565 


916 


1005 


Mad 


X831CB 


0 


111 


167 


327 


Myc 


L00039 


314 


112 


62 


173 


Elohi6 


W83R87 


169 


386 


313 


1003 


TBX1 


AAS4222A 


0 


0 


1 


2 



Shown are the transcription factors identified as present by the oligonucleotide array oner/sis whose maximal AD between perfect match end mismatch oligonucleotide 
sets was greater than or equal to 200 U to this study. Data are presented as described In the legend to Table 3. 
AD Indicates average diflerer.ee; gene symbols are expanded in an Appendix at the end of this article. 



peptides identified for a given protein were derived from regions 
along the entire length of the protein, indicating the observed 
products were not the result of proteolytic degradation. These 
data must be considered with several caveats: membrane and 
other hydrophobic proteins and very basic proteins are not well 
displayed by the standard 2DE approach, and proteins present at 
low levels will be missed. 35 In addition, to simplify MS analysis, 
we used a Coomassie dye stain raiher than silver to visualize 
proteins, and this decreased the sensitivity of detection of minor 
proteins. The MS method we used was sufficiently sensitive to 
identify proteins that could barely be visualized by colloidal 
blue stainiiig. However, a limitation of the method for the mouse 
is thai the current database lacks predicted amino acid sequences 
for a substantial fraction of murine genes. In addition, very 
small proteins give only a few peptides, making statistically 
confident identification difficult. 



Figure 5 shows the analytical colloidal blue-stained 2DE IPG 
reference maps of differentiated MPRO cells. Expression patterns 
of more than 500 protein spots were detected and observed through 
the entire series of gels. Protein spots could easily be cross- 
matched to each other, indicating the reproducibility of the method. 
As marked on the gel pictures (Figure 5), 50 proteins with a wide 
range of molecular weights (1 to 200 kd), isoelectric points (4 to 9), 
and abundances were subjected to MS protein identification. The 
results are presented in Table 6. 

Comparing the theoretical value of the molecular weight and pi 
of each prolein to th2t of the observed value, we confidently 
identified 28 proteins in the expected position on the gels (spots I to 
28). Some of the other proteins with strong matches to the murine 
databases migrated to a somewhat unexpected pJ position. Nine 
spots gave clear peptide peaks on mass spectroscopy but did not 
match any known gene. Their identification will require amino acid 
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Figure 5. 2DE ele ctropho ret og rams of MPRO cells. 
MPRO cell rysate (2.5 x 10 6 cell/sample) was loaded for 
2DE analysis. Gels were stoned with btilliartt blue G- col- 
loidal dye. (A) 20E map of uninducod MPRO ceil (0 hour). 
<B) 2DE map of matured MPRO cells (72 hcurs). Protein 
spots marked In the maps were considered differentially 
expressed and were subjected to MS analysis. The 
resuttenl protein information is Bsted in Table 6. 



IPGs 

in 



31 



4$ $.0 $& G.C &3 T>3 VjS fcOfcS 




160 

as 



45 i.0 £.5 &3 fiS ?.0 7.5 a t; 65 
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sequence analysis or availability of more extensive murine data- 
bases. We searched for the expression patterns of the genes cognate 
to the expressed proteins in dbMC (Table 6). Nineteen genes were 
found in dbMC, the mRNA for 5 genes was reported as absent, and 
13 genes were present during MPRO differentiation. Comparison 
of the expression patterns showed only 4 genes of 1 8 present on the 
oligonucleotide chips whose expression was consistent at the RNA 
level and protein level. None of these was on the list of the genes 

Table 6. Correlation of expression patterns between mRNA level and protein level 



that were differentially expressed significantly (5-fold or greater 
change by array or 2-fold or greater change by DD). 



Discussion 

We explored the temporal patterns of gene expression during 
myeloid developmenL A database has been developed to provide a 



Spot 


Protein definition 


Gl number 


Predicted 
value 

kd pi 


Percentage 
(*) 


2DE pattern 
Oh 72 h 


cDNA expression 
pattern 

Oh 72 h 


Ag 


1 


GRP76 


2505545 


72.4 


5.1 




1 


3 


1321 


1043.3 


N 


2 


Atffn, gamma, cytoplasmic 


B752954 


41.77 


5.3 


40 


3 


6 


0 


2 


Y 


3 


RHO GDI 2 


2494703 


22.83 


4.S 


33 


3 


3 


341 


441.6 


Y 


4 


ProWeratrng cell miciear antigen 


7242171 


26.77 


4.7 


42 


1 


0 


544 


430.9 


Y 


5 


APS kinase 


4038346 


69.8 


7.1 


24 


2 


1 


43 


50.7 


N 


6 


f^yruvate kinase 3 


6755074 


57.9 


7.2 


46 


6 


4 


3047 


5880.3 


N 


7 


Melanoma X-actln 


6671509 


41.72 


5.3 


39 


1 


3 


2539 


341.3 


N 


8 


Gryceraidehyde-3-phosphate dehydrogenase 


6573937 


35.79 * 


8.7 


39 


B 


7 


3073 


5742.3 


N 


9 


Stetin 3 


461911 


10.59 


6.9 


48 


0 


4 


N/A 


N/A 




10 


Guanine nucleotide binding protein, beta-2, 






















related sequancel 


6680047 


35.06 


7.9 


21 


4 


2 


139 


303 1 


N 


11 


Triosephosphate isomerase 


6578413 


25.69 


6.9 


26 


3 




3312 


2650.1 


Y 


12 


Testis -derived c-abl prote!n 


1196524 


17.19 


7 


51 


2 


3 


152 


126.9 


N 


13 


RNA binding motif protein 3 


794S121 


16.59 


6.8 


25 


1 


0 


625 


812.4 


N 


14 


Coltapsln response mediator 


6681019 


62.16 


6.4 


36 


2 


0 


Absent 


Absent 


N 


15 


Larrrln A 


22C474 


47.52 


6.6 


35 


2 


0 


Absent 


Absent 


N 


16 


47-kd keratin 


52783 


35.62 


4.8 


29 


3 


0 


Absent 


Absent ' 


N 


17 


sid478p 


5931565 


31.3 


6.7 


3D 


1 


2 


Absent 


Absent 


N 


1B 


MHC class 11 H2-IA-beta-5 


3168662 


28.6 


7.1 


39 


1 


2 


N/A 


N/A 




19 


Andrcgervbiriding protein: subunft alpha 


739346 


8.04 


6.4 


6e 


0 


2 


Absent 


Absent 


N 


20 


Neuronal apoptcsts inhibitory protein 


5932010 


158.7 


6 


17 


1 


0 


N/A 


N/A 




21 


PAD type IV 


6755018 


74.46 


7.2 


21 


1 


3 * 


N/A 


N/A 




22 


Human serum albumin homologcue 


3212625 


65.45 


5.7 


24 


0 


6 


N/A 


N/A 




23 


syncrip 


6576815 


62.53 


7.2 


33 


2 


1 


N.'A 


N/A 




24 


Transamkllnase 


1730203 


48.22 


7.2 


31 


3 


1 


N/A 


N/A 




25 


PGK crier phespf logtycerete 


173G519 


44.54 


6.3 


47 


S 


4 


108S 


1402.3 


N 


26 


Proliferation-associated gene A 


6754976 


22.16 


6.6 


53 


3 


1 


N/A 


N.'A 




27 


Putaluve peroxisomal antioxidant enzyme 


3913065 


17 


7.6 


55 


0 


3 


N/A 


N/A 




28 


IgE chain C2 region 


2137430 


12.1 


s *2 


38 


0 


1 


N/A 


N/A 





The proteins listed here are represented by the spois marked In the electrophoretograms shown in Figure 5. 

Protein definition. Gi number, and predicted value refer to the protein name, accession number, end properties derived from the National Center for Biotechnology 
Infcrroation protein database. The column labeled % shows the percentage of peptides predicted from the protein sequence that were deioctetf by mass spectroscopy. The 
expression level of protein spots expressed in mouse promyelocytic cell line cell Induced by oft-frans retinotc add for 0 Iwurs end 72 hours (Figure 5) were scored on o scale of 
1 (+) to 8 1+ + + -r + + + +) in the 2DE pattern column. The cDNA expression patterns of the cognate mRNAs are listed in the cDN A expression pattern column abstracted from 
the dbMC database. The genes not represented oh the oligonucleotide arrays were marked as N/A Ag showed the correlation of gene patterns a: mRNA level or protein level. 

Y indicates agreemerJ and N discrepancy between changes in cDNA and protein spot intensity. The numbers in bold were obtained with DD. 2DE indicates 2-dimcnsiona! 
gel electrophoresis; IgE, immunoglobulin E; DD, differentia! display. 
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reference for later research on the molecular mechanisms underly- 
ing normal myeloid development. 

The MPRO cell system morphologically mimics normal myeloid 
differentiation and biochemically proceeds further toward mature neutro- 
phils than most other in vitro systems. Because the arrest in differentia- 
tion of MPRO cells growing in the absence of ATRAis not physiologic, 
there is a theoretical risk that gene expression in these ceils is not 
coordinated in the way (hat it is in normal differentiation. It is 
encouraging that, for the most part, the timing of expression of genes for 
proteins of the various neutrophil granules is consistent with the timing 
of the morphologic and biochemical appearance of these granule 
components during normal myeloid differentiation. 

The DD technique provides certain advantages for detecting 
and comparing mRNA levels in different samples. First, the method 
is, in principle, similar to competitive RT-PCR, and, with the use of 
stringent PCR conditions, is expected to be about as reliable. 
Second, display patterns are reproducible. Third, the method 
detects the levels not only of RNAs already represented in the 
database but also of unknown RNA species that may represent 
"new" genes. Fourth, closely related genes can be distinguished 
regardless of cross-hybridization, provided there are some single 
nucleotide differences in the 3' end sequence. Limitations associ- 
ated with this technique are that numerous gels are necessary to get 
complete information and that comparison of the levels of different 
mRNAs is only approximate because of the differential amplifica- 
tion of bands of different size or sequence. 

Oligonucleotide chip analysis is a fast and effective means of 
accessing mRNA expression patterns. 20 Cluster analysis of groups 
of samples by this approach is effective. However, the present 
results indicate that alternative methods of verification are desir- 
able before the data on an unexpected change in a particular gene 
are definitively accepted. 

To obtain the broadest range of information from the myeloid 
differentiation process, both differential display and oligonucleotide 
chip techniques were applied in the current study. As a result, 65.3% of 
the observed changes in mRNA levels came from the differential display 
method and 4 1 .5% came from oligonucleotide chip assays. 

Our data showed in general that changes in expression pattern 
by the 2 methods agreed qualitatively but that there was some 
quantitative variation. Our results indicate that DD may be a more 
accurate way to detec t changes in levels of gene expression than the 
oligonucleotide chip assay. However, improvements in the types of 
oligonucleotides used in arrays may close this gap in the future. 

The mRNAs for a limited number of transcription factors vary in a 
pattern correlating with that of the mRNAs for primary or secondary 
granule proteins. However, more detailed infomiation is needed, and the 
underlying mechanisms of granule gene regulation remain unclear. The 
number of potential positive and negative regulatory factors found here 
is sufficiently small as to make it feasible to perform in vivo studies, 
such as chromatin immunoprecipitation. 

The oligonucleotide chip used in this study focused on known 
genes, whereas the DD method samples all polyadenylated tran- 
scripts. The latter method generated a large number of products not 
associated with known genes, in part because the mouse genome is 
not as well represented in the database as the human genome. 
However, our experience with DD and human mRNAs indicates 
that substantial fractions of the products represented as ESTs or not 
represented nt all in the public databases are cDNA copies from 
inlrons. hnRNA, or other RNA with internal A runs. 

Approximately 59 sequences obtained from gel -display bands 
had significant changes in the level of expression and e sequence 
that did not match that for any named gene in the public databases. 



Of these, 38 had plausible or excellent polyA signals. This is only 
an approximate estimate of the number of new genes found 36 
because a fraction of the mRNAs for known genes still had poor 
polyA signals. In addition, the full 3' untranslated region is often 
not known for characterized genes, and in some cases these new 
genes may prove to be identical to products identified by the 
oligonucleotide chips when more complete sequences are obtained. 
At the least, their presence indicates that a substantial fraction of 
the regulatory or functional circuitry of maturing myeloid cells 
remains unexplored and that valuable tools for their investigation 
will emerge from a combination of RNA expression studies and 
analysis of emerging genomic sequences. 

The desired end point for the description of gene expression in a 
biologic system is not only the analysis of mRNA transcript levels 
but also the accurate measurement of protein abundance. The 
developments in 2DE and new MS instrumentation make it 
possible to accomplish this work rapidly and efficiently. In this 
study, we attempted to identify a number of the proteins differen- 
tially expressed between uninduced and ATRA-differentiated MPRO 
cells and to examine the relation between mRNA and protein expression 
levels for these genes representing the same state. 

For protein levels based on estimated intensity of Coomassie dye 
staining in 2DE, there was poor correlation between changes in mRNA 
levels and estimated protein levels. Other groups have studied the 
correlation between mRNA and protein levels in yeast and liver 
cells.' J- 12 * 14 In the liver cell exTXiriments, 1 '*' 2 correlation coefficients of 
0.4 to less than 0.5 were observed. Tn an extensive study in yeast,"' 12 the 
correlation coefficient was high if the most abundant mRNAs and 
proteins were considered. If a handful ofthese products was omitted, the 
remaining correlation coefficient was 0.4 or less. However, one 
could restore some of the correlation by averaging individual 
data points into broad prot comic categories.- 7 

The discrepancies between mRNA and protein levels in MPRO cells 
appear to be substantially larger than those observed for yeast Possible 
causes for the discrepancies include translational regulation, differential 
expression of certain mRNAs nt various stages of cell growth m vitro, 
post-translational protein modification that varies with the stage of 
maturation of the cells, and selective degradation or excretion of proteins 
in vivo. Furthermore, here we are focusing on a developmental 
time-course, whereas the yeast study concentrated on the organism in 
vegetative growth. New techniques, equipment, and bioinformatic 
analysis tools must be developed to make such systematic, global, and 
quantitative analyses feasible. ' 

The initial studies of protein expression presented here provide a 
cautionary note for efforts to interpret cell composition and function in 
relation to mRNA levels. Discrepancies we observed between gene 
expression and protein abundance suggest that selective post-transcrip- 
tional controls may be at least as important as changes in mRNA levels 
in determining the protein composition of neutrophils and that they are 
phenomena less well explored than transcriptional controL Analysis of 
mRNA expression patterns is itself only a small tegiiining toward a 
genome-wide description of cellular components^ 
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Appendix 

Gene symbols used in tables: Adb: aclin, beta, cytoplasmic; Acig: adin, gamma, 
c>tophisinic; Actx: melanoma X-actin; Aidol: aldolase 1, A isotbma; Mi5: 
ADP^&osylatiott factor 5; AU'l: ai'twating Lrausoiption radw 1; Atl2: acuvaiing 
aansctiption iaaor 2: Bd3: basic transoipuon lactor 3* Bzip: paiphefal-typc 
bcnzodiazcpiiic irccptoi; C5:T: complement companoit 5, receptor IAj protein- 
coupled receptor (C5a% Cmb2: cycHn B2; Cd3612: CD36 antigGn (collagen type 
I receptor, thrombospondfn r«eptor>liVe 2; Cd53: CDS3 anti K en; Cebpa: 
CCAAT/cnhanccr binding protein OBBP, alpha; Ccbpb: CCAAT/cnhanccr 
binding protein (C/EBP), beta: Ccbpd: CCAAT/enhanccr binding protein (C 
EBP), dclu; Ccbpc: CCAAT/mliancer binding protein (C/EBP), cpsifcui; Cfll : 
cotilin 1 , nonniuscie; Cmk2i4: chemokine (C-X-C) rtceptor 4; Cmkbr 1 : chemo- 
kinc (C-Q iweptor 1/NIipla recepion Cnlp: csthclin-Ukc protein; Cutf: ciliaiy 
neurotiopic factor/zinc finger protein PZF; Copa: eoatomcr pvtcin complex 
6vvbunit alpha; Cpa3: caibox>T>cpuda5e A3, mast cell; Cr2: com|)lcment receptor 
I; Crhr: ciirticotroptn releijsmK hormone rctcpUw; Crry: ^implement rcccpUT- 
rdsiud protein; Csnr: CSF I (M-CSF) toccptM/c-fmsiCDI 15; Csllra: CSF 2 
(GM-CSF) receptor, alpha, low-atTmitj'/CD: 16; Csf2rbl: CSF 2 (GM-CSF) 
receptor, beta 2, Uiw-attinity/U . 3 rect^tor-likc protein (AlC2B)/Ci>A'13J; 



Cst2rb2:CSF 2 iGM-CSF) rc:cptor, beta 2, low-affiniry/lL-3 receptor (A!C2AK 
Ctsb: catfaepsin B; Ctsc: cathepsin C; Ctsd: cathepsui l>, Ctsc: cathcpsinE; Osg: 
iradicpsin G; Ctsb: cathepsin rfc Osl: caihepsin L: Qss: cathepsb S; Cybb: 
r/iochrome b-245, beu: Drdl: dopaniine receptor 2; E21T: H2F tjanscription 
factor 1; Ear2: eosuicpbil-associated ribonuclease 2; Ebi3: Epstein-Banr virusr- 
iuduced gene 3/cytokine iecei>tor-4ikc molecule (EB13); FJ2: Balb/c neutrorJiil 
elasta<*; FJa2: elastaw 2; Erh: enhancer of rudrmentary homolog (Drosoprrita); 
Boh:6: cthanol induced 6.'surrol regulatory ^ elcnicntbtndmg trdnscripttcn factor I 
(SREBFI) homolog; F2H2: coagulation factor TI (dirombin) reccptor-likc 2; 
Fcer lg: Fc receptor, IgE, high aHiniry I, ganinia porypeptide; Fcgr2b: Fc receptor, 
IgG, low aliiniy lib: Fcgr3: Fc receptor, IgG, low affinity III; Fprl: ibraryl 
peptide receptor L4MLP lrctptoi; Gabpbi: GA repeat binding protein (GABP- 
bctal sublet); Gata2: GATA-binding protein 2; Gnas: guanine nucleotide 
bmding 'rovtcin. clpha slimulating; Gnb2-rsl : guanine nacicotidc bbding protein, 
bcta-2, relrA«l scqacnee I; Gpx.l: glutathione perrrxidasc 3; Grg: rclafeil to 
Dnvjophila gpmciin gene; Gridl: ghuamate receptor cluuinel subunit ri\:lLi I; 
Gm: granulin: Gslm I : glutaEhione-S-transfda.se. mu I; Gus-s: bcta-gbcuroni- 
dase structural; Gys3: glycogen s>-nl)iase 3, brrtin; H2-D: bistccOxurMlibUity 2, D 
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region locus 1; Hist2: hist&ne gene complex 2; Hist5-2ax: H2Ahistone rarniry, 
member X: Hingi: high mobility group protein I; Hsp60: heat shock protein, 60 
kDa; HtrSa: 5-hydravy try ptnmine (serotonin) receptor 5A; Idbl: inhibitor of 
DNA binding l/helix-loopielix DNA binding protein regulator (Id); Idb2: 
inhibitor of DNA binding 2; Imgn interferon gamma receptor; Ifhgr2: interferon 
gamma receptor 2; Ti: la-associated invariant chain; Ilia: ILl alpha; Illr2: IL1 
receptor, type II; U2rg: IL2 receptor, gamma chain; IWra: IL4 receptor, alpha; 
IllOrb: H.10 receptor, beta; 11 17r. IT M receptor; Irfl: iuterfemn regulatory fector 
1 ; frf2: interferon regulatory fector-2; Itgb2: integrin beta 2 (Cd 18); ItprS : inositol 
l»4^-trisphosphatc receptor (type 2); Jundl: Jun rtfOUHOT^gene-related gene 
dl/transcription factor JUN-D; KlfZ: Kruppel-like factor LKUF; L-CCR: lipopoly- 
saccharide mduc&te C-C cbemokine receptor-related; Lcn2: lipocalin 2; Ldlr: 
fc>w density lipoprotein receptor, Lspl: Lymphocyte-specinc l/S37/pp52; Lstl : 
feiicocyte-specific transcript 1 ; Lth4r. leukotrierte B4 receptor; Lttr. tymphotoxin- 
beta receptor; Ltf: lactotnnsferrin; Ly64: lymphocyte antigen 64; Ly6e: lympho- 
cyte antigen 6 complex, locus E; Lyll : lyrnphoblastomic leukerrua/bHLH fector; 
Lyzs: lysozyme; M6pr: mannose-6-phosphate receptor, cation dependent; Mad: 
Max dimerization protein; Man2cl: mannosidase, alpha, class 2C, member 1; 
Max: Max protein; Maze MYC-associated zinc ringer protein (p urine-binding 
transcription factor); MBP: eosinophil granule major basic protein precursor; 
McptS; mast cell protease 8; Mil: myetoid/rymphoid or rnixed-lineage leukemia;' 
Mmpt3:niatrixmetalloproteinase l3/collagenase;Mmp9: matrix metalloprotein- 
ase 9/gelatinase B; Mpo: myeloperoxidase; Myb: myeloblastosis oncogene; 
Mybl2: myeloblastosis oncogene-hke 2; Myc: myelocytomatosis oncogene; 
Myln: myosin light chain, alkali, nonmuscie; Nfatc2: nttclear fector of activated T 
cells, cytoplasmic Z: Nfe2: nuclear fector, erytnrokt-derived 2, 45 kDa; Nfkhl : 
NF-kappa-B (p 105); Ngp: neutrophiiic granule protein; NMDRGB: N-methyl-D- 
aspartate receptor gliitamate-binding chain homolog; Nprr.l: nudcxiphosmui 1; 
Nr4al : nuclear receptor subfamily 4, group A, member i ; Osi: oxidative stress 
induced; P2rxl: puriuergic receptor P2X, ligand-gated ton channel, I; P2ry2: 
purinergic receptor P2Y, G-protein-coupled 2; P40-8: P40-8, fundbnal/kuninin 
receptor, Phxl: pre B-cell leukemia transcription tactor I; Pic: properdin factor, 
complement; Piral: paired-lg-like receptor AI; Pira5: paired-lg-like receptor 



A5; Pira6: paired-Ig--13ce receptor A6; Pirb: pairedVIg-like receptor B; Plaur: 
urokinase plasminogen activator receptor; PMI: putative receptor protein (SP: 
P17152 ); Pnil: promyelocyte leukernia; Prg: proteoglycan, secretory granule; 
Prg3: proteoglycan 3</eosinophU major basic protein 2; Prtn3: proteinase 3; 
Psma2: proteasome (prosome, macropain) sub unit, alpha type 2; PtmM: protfay- 
mosin beta 4; Ptprc: protein tyrosine phosphatase, receptor type, C: Rac2: 
RAS-reiated C3 botulinum substrate 2; Rarg: retinoic acid receptor, gamma; 
Rela: avian miculoendotheliosh viral (v-rel) oncogene homolog A/NF-kappa-B. 
p65; Rpli9: ribosomal protein L19; RPL8: ribosornal protein L8; Rps6kai: 
ribosomal protein S6 kinase polypeptide I; Rps8: ribosomal protein S8; Rtn3: 
reticulon 3; S100a8: S100 calcium binding protein A8 (calgranulin A); Sl00a9: 
SI 00 calcium-binding protein A9 (calgranulin B); Sdfr2: stromal cell-derived 
fector receptor 2; Sell: selectin L (lymphocyte adhesion molecule I); Sema4d: 
semaphorin 4D; Seppl: fudenoproteiu P, plasma, I; Sfpil: SFFV proviral 
integration I; Shfdgl: split hand/foot deleted gene I; SlclOal: solute carrier 
femiry 10 (sodhnn/bile acid cotransportcr family), member 1; Slpt secretory 
leukocyte protease inhibitor; Soxl5: SRY-box containing gene 15; Spi2-1: serine 
protease inhibitor 2-1; Srbl: scavenger receptor class Bl; Stat3: signal transducer 
and activator of transcription 3; Stat5a; signal transducer and activator of 
transcription 5 A; Stat6: signal transducer and activator oftnmscription 6; Stra 14: 
basic-heux4oop-helix protein-retinoic acid induced; Tbxl: TBX1 proteaVLPS- 
induccd TNF-alpha factor homolog; Tcrgb: T-cell-receptor germline beta-chain 
gene constant region; Tcrg-V4: T-ceU-reoeptor gamma, variable 4; Ictexl: 
t-complex testis expressed t; Tfdpl: tamscription fector Dp i; Tiflb: transcrip- 
tional intermediary factor 1, beta; TIr4: toll-like receptor 4; Tnfrsfla: TNP 
receptor superfamily, member la; Tntrsflb: TNF superfamily. member lb; 
Tbmm70a: trenslocase of outer mitochondrial membrane 70 (yeast) homolog A; 
Tpi: triosephosphate isomerase; Trp53: tramforrnation-related protein 53; Ubb: 
ubiquitin B: Usf2: upstream transcription fector 2; Yhxl: Y box transcription 
factor; Yhx3: Y box binding protein; Zip 1 1-6: zinc finger protein s 1 1 -6; ZtplB: 
zinc finger protein 1 8 homolog; ZfpJ6: zinc finger protein 36; Zfpl62: zinc finger 
protein 162; Zfp216: zinc finger protein 216; Zfpml: zinc finger protein, 
multitype I ; Znfiil al: zinc ringer protein, subfeinUy 1A, I (Ikaros); Zyx: zyxin. 
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Bacterial lipopolysaccharide (LPS) evokes several 
functional responses in the neutrophil that contribute 
to innate immunity. Although certain responses, such as 
adhesion and synthesis of tumor necrosis factor-**, are 
inhibited by pretreatment with an inhibitor of p38 mi- 
togen-activated protein kinase, others, such as actin as- 
sembly, are unaffected. The aim of the present study was 
to investigate the changes in neutrophil gene transcrip- 
tion and protein expression following lipopolysaccha- 
ride exposure and to establish their dependence on p38 
signaling. Microarray analysis indicated expression of 
13% of the 7070 Affymetrix gene set in nonstimulated 
neutrophils, and LPS up-regulation of 100 distinct 
genes, including cytokines and chemokines, signaling 
molecules, and regulators of transcription. Proteomic 
analysis yielded a separate list of up-regulated modula- 
tors of inflammation, signaling molecules, and cytoskel- 
etal proteins. Poor concordance between mRNA tran- 
script and protein expression changes was noted. 
Pretreatment with the p38 inhibitor SB203580 attenu- 
ated 23% of LPS-regulated genes and 18% of LPS-regu- 
lated proteins by S-40%. This study indicates that p38 
plays a selective role in regulation of neutrophil tran- 
scripts and proteins following lipopolysaccharide expo- 
sure, clarifies that several of the effects of lipopolysac- 
charide are post-transcriptional and post-translational, 
and identifies several proteins not previously reported 
to be involved in the innate immune response. 



Lipopolysaccharide (LPS), 1 a component of the outer cell wall 
of Gram-negative bacteria, evokes a variety of functional re- 
sponses in the human neutrophil (PMN) after binding to a 
plasma membrane receptor complex that involves the Toll-like 
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receptors (TLRs) (1-5). These "immediate*' functional re- 
Bponses, including actin assembly, adhesion, uctivation of nu- 
clear factor-kappa B (NF-kB), and priming for an enhanced 
secretory response and for release of reactive oxygen interme- 
diates, appear to be central both to the innate immune re- 
sponse and to the pathogenesis of several inflammatory human 
diseases, including sepsis and the acute respiratory distress 
syndrome (6). p38 mitogeh-activated protein kinase (p38 
MAPK) has been shown to mediate LPS-induced PMN adhe- 
sion, NF-kB activation, and TNF-a and IL-8 translation and 
release (7), and its blockade attenuates LPS-induced PMN 
accumulation in the airspace (8). However, other cascades al- 
most certainly lead to downstream effectors of the LPS signal; 
for example, actin assembly appears to be p38 MAPK-inde- 
pendent (9). An improved understanding of the transcriptional 
and translation^ responses of the neutrophil to LPS and the 
modulation of these responses by p38 MAPK might carry 
pathogenetic and therapeutic implications. 

Historically, it has been believed that the downstream PMN 
transcriptionol response to LPS is static and that PMN func- 
tional responses to LPS that depend on de novo protein syn- 
thesis are primarily limited to the release of cytokines (10). 
However, recent studies indicate a robust transcriptional re- 
sponse (11). To date, most studies have relied upon and re- 
ported a short list of functional assays of the LPS-exposed 
PMN; therefore, no exhaustive investigation of either the tran- 
scriptional response or protein synthetic repertoire of the PMN 
has been reported. Although several techniques huve been used 
to evaluate transcripts, the screening of global changes in 
mRNA by microarray analysis has only recently become possi- 
ble. In this way, thousands of genes can be screened in an 
unbiased fashion for transcript abundance. Such genomic 
screens in mammalian cells have previously been applied to 
define altered expression profiles in response to agonists (12) 
and to drug action (13) and during cell cycle progression (14). 

Although DNA microarray technology is expected to provide 
insight into the response of the human PMN to LPS (15), 
inliibition of LPS-stimulated IL-1 and TNF-<* production by 
p38 MAPK inhibitors in THP-1 cells (16) and of TNF-a synthe- 
sis in human PMNs (9) occurs at a translational level and 
would therefore not be detected by DNA microarrays. Further- 
more, in other systems, such as yeast and human liver, mRNA 
and protein levels show poor correlation (17, 16). Proteomics is 
a complementary tool for assessing global changes in cellular 
protein expression, thereby providing additional insight into 
cellular signal regulation. A proteomic approach has proven 
useful in different systems for dissecting signal transduction 
cascades and describing their output (19, 20) and has even 
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recently been used to detect novel upstream messengers in- 
volved in LPS signal transduction (21). We have applied DNA 
microarrays and proteomics to define and compare transcrip- 
tional and post^transcriptional alterations in the LPS-exposed 
PMN and to establish the dependence of these alterations on 
p38 MAPK signaling. 

KXPE RiMENTAL PROCEDURES 

Materials — Endotoxin-free reagents and plastics wore used in all 
experiments. Aprotinin, leupeptin, AEBSF, E-64, pepstatin, andbosta- 
tin protease inhibitors, spermine HC1, and a-cyano-4-hydroxycinnamic 
acid (CHOA) were all purchased from Sigma Chemical Co. (St. Louis. 
MO). SB203580, a p38 MAPK inhibitor, was purchased from Calbio^ 
chcm-Novabiochem Corp. (San Diego, CA). For two-dimensional PAGE, 
rehydration buffer, equilibration buffers, vertical electrophoresis solu- 
tions, and 10% homogeneous polyacryl amide slab gels were purchased 
from Genomic Solutions, Inc. (GSI, Ann Arbor, MI). Sequencing grade 
porcine trypsin was purchased from Promega (Madison, WI). 

LPS Incubation — PMNs were isolated by the plasma Percoll method 
(22), a technique Dial yields less than 6% monocytic contamination, and 
resuspended at a concentration of 15.4 X 10°/ml in RPMI 1640 culture 
medium (BioWhittaker, Walkersville, MD) supplemented with 10 mM 
HEPES (pH 7.6) and 1% heat-inactivated platelet-poor plasma. After 
addition of 100 ng/ml Escherichia coli 0111:B4 LPS (List Biological), 
incubation was carried out with continuous rotation (4 h, 37 *€) both in 
the presence and absence of SB203580, Both Asymetrix analysis and 
proteomic analysis utilized 75 X 10 6 cells. For microarray analysis, 
nonstimulated and 4-h-treated PMNs were collected from three sepa- 
rate donors. A more detailed time course following LPS exposure was 
performed using polymerase chain reaction. For proteomic analysis, 
LPS incubations from separate donors (n = 6) were performed and then 
analyzed individually. Control and poat^LPS incubation PMNs were 
washed (0.34 M sucrose/1 mM EDTA/10 mM Tris) and then tysed in a 
modified rehydration buffer (GSI, Ann Arbor, MI) supplemented with 2 
M thiourea, GO mM dilhiothreitol (DTD, 22.5 mM spermine HC1, and a 
mixture of six protease inhibitors (10 /ig/rnl aprotinin, 10 ;tg'ml leupep- 
tin, 2 mM AEBSF, 5 fxH E-64, 1 pepstatin, 10 um bestatin). DNA was 
pelleted by centrifugation at 250,000 X g for 60 min (23). 

Affymetrix Oligonucleotide Array — Five micrograms of total RNA was 
isolated with TRIzol (Invitrogen) and RNeaay columns (Qiagen) and 
subsequently labeled with biotin as described by Asymetrix. Briefly, 
first-strand synthesis was accomplished with Superscript II reverse tran- 
scriptase (Invitrogen) using a T7-oJigo( dT^ primer for 1 h at 42 e C 
followed by second-strand synthesis using & coli DNA polymerase I and 
RNase H (Invitrogen) at 16 °C for 2 h. Double-stranded DNA was used as 
a template for in vitro trans cription with 17 RNA polymerase in the 
preHence of biotin-laheled UTP and CTP using the BioArray High Yield 
RNA transcript labeling kit QStuso). Fifteen micrograms of cRNA was 
fragmented and used for hybridization to Affyinetrix HuGene 6800FL 
Genechipa. Each sample was hybridized initially using a Test2 Genechip 
to test for sample degradation and full-length in vitro translation. Data 
were analyzed using Asymetrix Genechip software. Results from three 
separate donors were analyzed. 

Reverse Transcription and Polymerase Chain Reaction — cDNA was 
prepared by reverse transcription using 2 /ig total RNA, derived from 
20 X JO* cells that were treated as indicated. Polymerase chain reac- 
tions were performed using specific primers for Mx-1, TNF-a, MCP-1, 
p65, SW0A4 t and glyceraldelryde-S-phosphate dehydrogenase. 

Two-dimensional PAGE— The protein concentration of the lysates 
was measured as described by Bradford et at (24). Poor isoelectric 
focusing (IEF) results wore encountered unless the polycationic sperm- 
ine was diluted (data not shown); therefore, lysates were diluted with 
rehydration buffer (GSI, Ann Arbor, MI) to achieve a final spermine 
concentration of 6 mM. Equal protein loads (1.5 mg) of control and 
LPS-stimulated neutrophils were used to rehydrate IEF gels overnight 
(18 cm, pH 3-10 nonlinear Immobiline DryStrip IEF gels, Amersham 
Biosciences; Piscataway. NJ). IEF was performed at 20 °C to 100-kVh 
(Phaser, GSI) under mineral oil, followed by two 10-min SDS equilibra- 
tion steps (DTT and then iodoacoUomide-conteining equilibration huff- 
erg, GSI) and then by vertical electrophoreais on 10% homogeneous 
polyacrylamidc fdab gels (GSI) fit 500 V. Protein spots were visualized 
by agitation in colloidal Coomassie Brilliant Blue G-250 (16 h) (25), 
followed by destaining in deionized water (20 h). In separate experi- 
ments, control ond LPS-stimulatod PMN lysates from three donors 
were pooled and then analyzed by two-dimensional PAGE using over- 
lapping narrow isoelectric point (pi) ranges (1.8 cm, pH 5.0-6.0, 5.5- 



6,7, and 6-11, Axneraham Biosciences, Piscatewsy, NJ). Identical IEF 
and vertical electrophoresis parameters were used for all gels. 

Image Analysis of Two-dimensional Gals — Colloidal Coocnassie- 
Estsined gels were digitized using a Powerlook II (UMAX Data Systems, 
Inc., Taiwan) flatbed scanner with 8-bit dynamic range and 150-dpi 
resolution. Biolm&ge (GSI, Ann Arbor, MI) 2D-Analyzer software was 
used to locate, quantitate, and match protein spots on the control and 
LPS gel images. Analysis was performed by assigning 60 common 
anchor epots between paired images; the remaining spots were com- 
pared by a constellation- matching algorithm. All data were then care- 
fully reviewed by the operator to account for any discrepancies. Protein 
loading between control and experimental gela may have varied be- 
cause of inconsistencies in rehydration of the different IEF gel strips; 
therefore, gel images were normalized so that the sum of the integrated 
intensities of all matched spots on paired gels was made equaL Control 
and LPS-stimulated gel images from individual donor experiments 
were matched to generate composite images; composite images were 
then matched into a master composite image to track the LPS response 
of protein spots among different donors (26). Only those spots that were 
common (image-matched) to all original 12 (pH 3.0-10-0) gels were 
considered for further analysis. For these spots, the LPS-iuduced 
change in integrated intensity in the six experiments was subjected to 
statistical analysis with a two-tailed Student's t test, and those spots 
withp < 0.05 were identified b}' peptide mass fingerprinting (described 
below). For the narrow range (pH 6.0-6.0, 5.6-6.7, and 6-11) two- 
dimensional PAGE experiments using pooled donors, only those spots 
with concordant regulation exceeding 1.5-fold or that appeared de novo 
in the LPS gel in two repeat experiments were further analyzed. 

In~gel Tryptic Digestion—ln-gcl digestion of protein spots waa per- 
formed with sequencing grade porcine-modified trypsin using the 
method of Hellman et al. (27). Tryptic peptides were then extracted (50 
fj\ of 50% acetonitrile/5% trifluoroacetic acid, 2 h), and the supernatant 
was taken to dryness in a vacuum centrifuge and then redissolved in 
trifluoroacetic acid (20 pi, 0.6%). Peptides were then purified and con- 
centrated using ZipTip Ci8 pipette tips (Milliporc, Bedford, MA). 

MALDI-TOF Afass Spectronietry — Analyses were performed on an 
Applied Biosystems matrix-assisted laser desorption ionization time-of- 
flight (MALDI-TOF) Voyager-DE PRO mass spectrometer (Framing- 
ham, MA) operated in delayed extraction mode. Samples (0.5 /d) were 
spotted onto a sample plate to which matrix (0.5 juJ of 10 mgfanl CHCA) 
was added. The sample-matrix mixture waa dried at room temperature 
and then analyzed in reflector mode. CHCA was also spotted alone an a 
negative control. Spectra were the sum of 100 laser shots, and those 
peaks with a signal-to-noise ratio of greater, than 3:1 were selected for 
data base searching. Spectra were internally calibrated using autolytic 
trypsin peptides (m/z 842.61, 2211.10). 

Data Base Searching Algorithm — The monoisotopic masses for each 
protonated peptide were: (a) entered into the program MS-Fit (available 
at prospector.ucsf.edu) for searches against the Swiss-Prot, NCBI, 
and GenPept databases, and (ft) entered into Mascot (avaOable at 
matrixscience.com), an algorithm testing statistical significance of pep- 
tide mass nngerprinting identifications. For MS-Fit searches, masses 
derived from trypsin, CHCA, keratin, and Coo may si e Brilliant Blue 
G-250 were excluded. Search parameters included a maximum allowed 
peptide mass error of 0.1 Da (0.8 Da in the few instances in which linear 
mode was used), consideration of one incomplete cleavage per peptide, 
pi range of 3.0-10.0, and molecular mass range of 1—200 kDa Accepted 
modifications included carbamidomethylation of cysteine residues 
(from iodoacetamide exposure following IEF) (28) and methionine oxi- 
dation, a common modification occurring during SDS-PAGE (29). Pro- 
tein identifications were assigned when three criteria were met; 1) 
statistical significance {p < 0.05) of the match when tested by Mascot 
(matrixscience.com); 2) >20% sequence coverage by the tryptic pep- 
tides; and 3) concordance (±16%) with the molecular weight and pi of 
the parent two-dimensional PAGE protein spot. The following special 
exceptions were considered: (a) protein identifications not fulfilling 
criterion 2 were still assigned if criteria 1 and 3 were fulfilled and no 
other Homo sapiens proteins with peptide mass-matched p values < 
0.05 were identified by Mascot; (b) if criterion 3 was not fulfilled (lower 
than expected molecular weight), a cleavage product of the identified 
protein wa3 inferred, and the cumulative molecular weight of the tryptic 
peptides was compared with that of the two<hmenaional-PAGE spot to 
ensure that it was not exceeded; (c) if criterion 3 was not fulfilled (isolated 
discordance between theoretical and observed pi), post- translation al mod- 
ification of an imrocovered peptide was inferred; and id) if two or moreH. 
sapiens protein assignments with >4 mutually exclusive matching pep- 
tides wore identified, a protein mixture in the two-dimensional PAGE 
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spot was inferred aiul farther analysis halted (quantitative conduaioiis 
regarding the individual protein constituents could not be drawn). 

RESULTS 

Genes Differentially Expressed in LPS-stimulated Neutro- 
phils— Human PMNs were left untreated or incubated in the 
presence of 100 ng/ml LPS for 4 h. As a control to confirm that 
the PMNs were quiescent at baseline and that LPS resulted in 
normal stimulation, mRNA was isolated, cDNA was prepared, 
and PGR for TNF-a was performed. Little TNF-a expression 
was seen in nonstimulated cells, whereas LPS treatment led to 
an increase in expression in each of the donors subsequently 
used for microarray analysis (data not shown). No macrophage- 
colony stimulating factor receptor transcript was detected by 
oligonucleotide microarray analysis, confirming there was no 
significant monocytic contamination. 

Human PMNs express a limited repertoire of mRNA tran- 
scripts at baseline but respond to LPS with differential expres- 
sion of genes in many families. Considering only those genes 
present by microarray analysis in all three donors, unstimu- 
lated PMNs expressed 13.0% (923 of 7070 genes) of the Af- 
fymetrix gene set. Gene classes represented at baseline include 
metabolic enzymes, structural proteins, receptors, signaling 
proteins, and transcription factors. By comparison, human 
monocytes expressed —40% and human fibroblasts —35% of the 
represented genes (data not shown). By the criterion of a >3- 
fold increase in expression in all three donors on Affymetrix 
oligonucleotide array analysis, exposure of PMNs to LPS for 4 h 
resulted in the up-regulation of 100 genes (Table I). 

Genes from several differen t functional classes were induced 
in PMNs following LPS exposure. Of interest, a number of 
transcriptional regulators were induced, including transcrip- 
tion factors of the NF-kB family. The transcriptional NF-kB 
complex has previously been implicated in the regulation of the 
genes induced by LPS (11). The genes for several cytokines and 
chemokines were also found to be up-regulated. These include 
TNF-a, IL-1&, IL-6, MCP-1, MIP-3a, nnd MIP-lp (Table D. 
PGR was performed to confirm the results from the microarray 
analysis. PGR analysis on selected genes indicates that the 
time course for changes can be rapid or delayed but parallel the 
changes found in the array at the 4rh time point (data not 
shown). Other up-regulated genes included those for metabolic 
enzymes, immune response molecules, kinases, phosphatases, 
signaling molecules, adhesion and cytoskeletal components, 
interferon-stiinulated genes, and those with unknown or mis- 
cellaneous function (Table I). 

LPS stimulation of PMN also resulted in the' down-regula- 
tion of 56 genes (Table II). Down-regulated genes were identi- 
fied as transcriptional regulators, protein and lipid kinases and 
phosphatases, structural molecules, and signaling molecules. 
Genes for metabolic proteins were also evident, as were several 
uncharacterized genes. 

Two-dimensional PAGE and Image Analysis — In contrast to 
the limited number of transcripts found at baseline, PMNs 
were found to express a large number and variety of proteins in 
the non stimulated state (Fig. 1, A and C, and Tobies III-V). 
Reproducible protein expression patterns were found on the pH 
3.0-10.0 gels, and the majority of proteins fell in the pH 5.0- 
7.0 range (Fig. 1A). The basic region (pH > 7.0) consistently 
exhibited poor resolution, precluding meaningful image analy- 
sis and further workup (data not shown). Depending on the 
spot-finding parameters (minimum spot intensity, filter width) 
selected on the image onalysis software, spot-by-spot manual 
editing was found to be necessary to avoid over- and underde- 
tected spots; moreover, further manual editing was performed 
to screen for unmatched and mismatched spots following 
matching of paired control and LPS-stimulated gels. After spot 



editing, —1200 well-resolved spots were evident on each pH 
3.0-10.0 gel. In an attempt to improve resolution of the pi 
range bearing the greatest number of well-reeolved spots, over- 
lapping narrow pH range gels (pH 5.0-6.0, 5.5-6.7, 6-11) were 
also run. Of interest, a similar number of well-resolved spots 
(~1200) were detected on the narrow pH range gels (Fig. 1, C 
and D). Assuming a detection limit for Goomassie of 15 ng (0.25 
pmol, or 1.5 X 10 11 molecules, for a 60-kDa protein) and a 
protein load per gel corresponding to 75 X 10 6 PMNs, we 
estimate a detection limit on our gels of 2000 molecules/cell for 
a 60-kDa protein. As investigators have suggested in other cell 
lines with the use of high resolution two-dimensional-PAGE 
methods (30), we estimate that > 10,000 proteins are expressed 
in the resting PMN. 

Human PMNs respond to LPS with the differential expres- 
sion of a large number of proteins. In the six individual pH 
3.0-10.0 experiments, the number of protein spots that in- 
creased in integrated intensity by at least'50% following LPS 
exposure was 185, 122, 104, 104, 96, and 131, respectively. The 
number of protein spots that decreased by at least 50% follow- 
ing LPS exposure was 72, 151, 102, 98, 128, and 97, respec- 
tively. Although gel-to-gel regional variability in resolution was 
expected to account for individual spots not being well visual- 
ized on particular gels, only those spots that were matched to 
all. 12 original gels were analyzed further. Overall, the number 
of spots matched to all 12 original gels was 125. The numbers 
of spots that were both matched to all 12 original gels and that, 
increased by at least 50% in integrated intensity in the indi- 
vidual experiments following LPS exposure were 46, 13, 17, 27, 
22, and 20, respectively. The numbers of spots that were 
matched to all 12 gels and that decreased by at least 50% were 
6, 22, 17, 22, 34, and 28, respectively. The LPS-induced change 
in integrated intensity of the 125 spots that were mat ched to all 
12 original gels was subjected to statistical analysis with a 
two-tailed Student's t test, and those spots with statistically 
significant <p < 0.05) regulation among the six experiments 
were identified by peptide mass fingerprinting (Tahle III). 

Identification of LPS regulated Proteins — Several proteins 
were consistently up-regulated on the pH 3.0-10.0 gels (Table 
HI), including regulators of inflammation (annexin HI) and 
signaling molecules (Rab-GDP dissociation inhibitor 0). Sev- 
eral actin fragments were seen to be consistently up-regulated 
in the six experiments following LPS exposure (Table HI). Of 
interest, the proteasome 0 chain was also consistently up- 
regulated. Down-regulated proteins included other signaling 
molecules, such as Rho GTPase activating protein 1. 

On the pH 5.0-6.0 and 5.5-6.7 gels, several proteins were 
found to show increases of greater than l:5~fold following LPS 
exposure (Tables IV and V), including cytoskeletal proteins, 
such as moesin, nonmuscle myosin heavy chain, and a putative 
phosphorylated form of nonmuscle myosin heavy chain, and 
signaling molecules, such as protein phosphatase 1 and P0 4 - 
stathmin. Hie putative phosphorylated form of nonmuscle my- 
osin heavy chain (spot #1101) was positioned 0.03 pH unit more 
acidic than the unmodified protein (spot # 1102) (Fig. ID) and 
was distinguished by a tryptic peptide (mlz 1366.74) not pres- 
ent in the unmodified protein, consistent with phosphorylation 
of serine 685. Serine 685 is predicted by NetPhos 2.0 Prediction 
Server (available at www.cbs.dtu.dk/services/NetPhos/(31)) to 
be a lugh probability phosphorylation residue and by Scan- 
Prosite { www .expasy.ch/tooWscn psite.html) to be a substrate 
for protein kinase G. The tryptic phosphopeptide identified in 
PO^-stathmm, extending from residues 15 to 27 (1468.7 Da), is 
consistent with phosphorylation of either serine 16, a known 
substrate for Ca 2+ /calmodulin (CaM)-dependent kinases (32), 
or serine 25, a known suhstrate for p385 and ERK (Fig. 2A) 
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Table I 

Human neutrophil genes induced after 4 k of LPS exposure 



Description GenBunk™ no. Change- fold 



Transcriptional regulation 

Pleomorphic adenoma gene -like 2 D83784 16.8 

NFKB2 S76638 12.3 

NFKBJE U91616 11.5 

P 65 U90G7 8.4 

BCL3 U05681 7.7 

X box binding protein 1 M31627 7.5 

Metal-regulatory transcription factor 1 X78710 7.4 

Ets2 J04102 7.4 

c-Rel X75042 6.2 

NFKB1 M58G03 5.8 

Basic leucine zipper transcription factor, ATF-like U15460 4.7 

1KB M69043 3.8 

MAX dimerization protein L0G895 3.6 

DIF2 S81914 3.1 

Cytokines and receptors 

MCP-1 M69203 78.7 

MIP-Tfl M72885 48.8 

' aHelix coiled-boil rod homolog AF014958 ' 20.8 

IL-lfl X045O0 17.6 

GR03 (beta) M57731 17.3 

TNF-a X02910 14.5 

M1P3« U64197 8.1 

ILJORA U00672 7.3 

IL-6 Y00081 6.3 

GROa X54489 4 

HM74 D10923 3.8 

Immune response 

Orosomucoid X02544 20.2 

Complement component C3 K02765 12.8 

Protease inhibitor 9 U71364 9.5 

Complement component 3a receptor 1 U28488 6.1 

Pro/cose inhibitor 3 L10343 4.9 

S/JWantileukoprotease X04470 4.7 

BMN/12/clastase inhibitor M93056 4.6 

CD58 Y00636 3.8 

Complement component PFC M83652 3.5 

Kinases 

CNKI FNKf PLK-like U56998 16.2 

Cot 014497 11.9 

Pim~2 U77735 9.5 

LIMK2 D45906 4.3 

Phosphatases 

PAC-1 /DUSP2 L11329 11.8 

DUSP5 U15932 5.3 

VHAt U734.77 3.4 

Signaling molecules 

TNFAIPUA1Q M59465 10 

TRAF1 U19261 6.2 

RanBP2 D42063 5.6 

GNA15 M63904 5.2 

PTAFR D10202 3.9 



Adhesion and cvtoskeleton 
ICAM1 

CEACAMl (bilary glycoprotein) 
UMS1 

SATVactin bundling protein 

Galectin-1 ILGALS1 

MEMD fALCAM 

CD44 

TSG6 



Metabolic 



OTP cyclohydrvlase I 


U19523 


13.5 . 


.VIJUFV^/ubiquinone reductase 


M22538 


8.6 


PSM/ioVlprotcosome iota) 


X59417 


8.4 


I; DF-g» lactose transporter (SLC35A2) 


084454 


7.3 


PLAU (urokinase) 


X02419 


6.4 


in Wt//li-kYnure.r.me hydrolase 


U57721 


5.5 


AMPD3 


D12775 


5 


P4HA J/prolyl 4-hydroxylase 


M24486 


4.7 


y Glutamylcvsteine synthetase 


L35546 


4.5 


ATP6D 


J05682 


4.2 


ATP6SJ 


01.6469 


4 



M24283 22.4 

X16354 6.3 

U09284 6.1 

U03057 5.9 

M57710 4.7 

U30999 4.2 

HG2981— HT3125 3.9 

M31165 3.7 
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Tabu; I — continued 



Description GenBank™* no. Change-fold 



Glycerol kinase K68285 3.6 

FACIA L09229 3.5 

AK3 1 X60673 3.3 

Interferon -inducible 

ISG15 M13755 22.5 

Mxl M33882 19.4 

IFI5G M24594 12.1 

iNDO M34455 6.2 

GBPI M55542 4.3 

PKKH U50643 3.7 

TFJT4 U52513 8.6 

XF154 M14GC0 3.5 

IFI58 U34605 3.5 

IFP35 U72882 3 

Other 

Gos2 M72885 48.8 

MIHCI1AP1 U37546 7.2 

KIAA0W5 D14661 6.1 

R2AA0118 D42087 5 

SNAP23 U55936 5 

CASP5 U28015 4.8 

IOAA01I3 D30755 4.8 

KIAA0255 DS7444 4.7 

Hepatoma-derived GF D16431 4.7 

PTGS2 D28235 4.6 

CD48 M37766 4.3 

UNC129 komolog U40998 4.2 

KIAA015J D63485 3.9 

Rablb XM035G60 3.8 

Annexin VII J 04543 3.7 

KIAA0110 D14811 3.7 

Adrenomedullin 0 14874 3.7 

AIM1 U83115 3.6 

KIAA0250 D87437 3.2 

P5-1 L06I75 S.2 

Scavenger receptor expressed by endothelial cells D63483 3.2 

VHL L15409 3.1 



(33). Assuming that no other multiply phosphorylated stath- 
min species had escaped detection, analysis of the integrated 
intensities of the P0 4 -stathniin and stathmin spots indicates 
that the percentage of the P0 4 form of total cellular stathmin 
increased from 11% to 38% with LPS stimulation (Fig. 2B). 
This is similar to a previous report of an increase from <10% to 
35-40% of the Ser £6 -phosphorylated form in Jurkat cellB stim- 
ulated with anti-CD3 (34). 

Effect of SB203580 on LPS-stimulatcd Gene Expression— 
Gene expression analysis of PMNs stimulated with LPS indi- 
cated that the majority of genes induced by LPS were unaf- 
fected by prior treatment of PMN with SB203580. Of the 100 
genes iip-regulated by LPS, the up-regulation of 23 was inhib- 
ited by greater than 40% (Table VI). The majority of these 
genes affected by SB203580 were inhibited by less than 60% ; 
whereas only six were inhibited by greater than 60%, all of 
which represent previously identified interferon-stiniulated 
genes. Induction of cytokine genes by LPS, with the exception 
of IL 6 r was generally unaffected by SB203580. 

Effect of SB2035S0 on LPS- stimulated Protein Expression — 
Similar to the effect of SB203580 on LPS-stimulated gene 
expression, little effect of SB203580 was seen on expression 
levels for the majority of LPS-reguIated proteins (Table VII). 
Two exceptions are annexin III and a-enolase, for which LPS- 
etirnulated expression was attenuated in the presence of the 
p38 MAPK inhibitor. 

Comparison of Microarray and. Proteomics Results — Of the 
LPS-regulated proteins identified by peptide mass fingerprint- 
ing for which probes were present on the oligonucleotide mi- 
croarray, poor concordance was found at the niRNA level (Table 
VIII). For 13 LPS- up- regulated proteins, 2 corresponding 



mRNA transcripts were up-regulated, 1 was down-regulated, 5 
were unchanged, and 5 were not detected by the Affymetrix 
chip. For 5 down-regulated proteins, 3 corresponding tran- 
scripts were down-regulated, 1 was unchanged, and 1 was not 
detected. Varying patterns of LPS regulation emerge for those 
candidates detected at both the transcript and protein level. 
Proteasome 0 chain was up-regulated at both the transcript 
and protein levels (Table Vlll), with no notable effect of 
SB203580 on expression at either level. Similarly, CAPl, Rho- 
GAP1, and ficolin 1 were down-regulated at both the roRNA 
transcript and protein level (Table VIII), with no notable effect 
of SB203580. Annexin HI was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Tables VII 
and VHI). 

DISCUSSION 

Interaction of bacterial LPS with the human PMN repre- . 
eents a model system for studying the activation and output of 
the innate immune system during infection and inflammation. 
A recent publication (35) describes the gene expression changes 
of a cultured monocytic cell line after infection by the Gram- 
positive bacterium Listeria monocytogenes. The cell wall com- 
ponents of Gram-positive bacteria, like Gram-negative-derived 
LPS (i.e. from E. coli), are known to signal through TLRs (36 ( 
37). Importantly, many of the expression change found in 
LPS-stimulated PMNs in the present study were also described 
in the bacteria-exposed monocytic cells, indicating that many of 
the gene expression changes seen in bacterial infection are 
likely mediated by TLRs (38, 39) and that the LPS model 
system accurately reflects exposure of immune cells to infee- 
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Tabus IT 

Human neutrophil genes repressed (>4-fold) after 4 h of LPS exposure 



Description 



GenBimk™ no. 



Change 



Kinases 
CAMK t II, gamma 
Diacylglycerol kinase, delta 
PRKCL2/PKK2 protein kinase C-like 2 
MAPKAPIC3 

Protein kinase Ht31, cAMP-dependent 
CAMKII 

Trans porters 
$LC25A5f&o\ute carrier family 25, member 5 
SLC19A1; folate transporter 
SLC2A3; facilitated glucose transporter 

Metabolic 

Carbonic anhydrase IV 
KNase A family, k6 
Glycogen phosphorylase; liver 
Inositol polyphospkate-6-phosphatase 
Inositol l,3,4~trisphospkate 5/6-kinase 
Transketolase 

Protein phosphatase 4 t reg. subunit I (clone 23840) 

Cytidine deaminase 

MGAT1 

HMOXl 

MAN2A2 

Glycogenin (also represents U31525) 

Structural 
Fibrinogen-like protein (pT49 protein) 
H2AFZ 
PaxUlin 
Lamin BR 
Dynamin 2 
Actinin 1 
a-TUbulin 

Tubulin, al, isoform 44 

Transcriptional regulators 

Lymplwhlastic leukemia-derived stiqutnce J 
MAX-interacting protein 1 
Nuclear factor crythroid 2 isoform f 
Transducer of ERBB2, } 
NFATC4 

ATF-2 (CRE-Bpa) 

Receptors 
Lympkotoxin p receptor 
Folate receptor 3 (gamma) 



Signaling 
Fix a; cool-2 (KIAA0006) 
AHHBMiioB 
TNFSF10; TRAIL 

Ca 2+ binding 
ANXII 
S100A4 
ANX1 

Other 

Protealipid protein 2 

Protein phosphatase I, a catalytic subimit 

TIMP2 

KIAA0199 

Lipin 2 (KIAA0249) 

CUGBP2 
Clone 23933 
PECAM1 

Delia sleep-inducing peptide 

MGeorge syntL critical region gene. 2 IKTAA0163) 
SBLPLG: CD162; selectin P ligand 



U50360 
D63479 
U33052 
U09578 

HG2167-HT2237 
1,07044 



J02683 
U17566 
M20681 



L10955 
U64998 
M14636 
U57650 
U51336 
L12711 
U79267 
L27943 
M55621 
X06985 
1.28821 

HG4334-HT4604 

Z3G531 
M37583 
U14588 
L25931 
L36983 
M95178 
X01703 

HG225&-HT2348 



M22633 
L07648 
S77763 
D3B305 
L41067 
L0551.5 



-ft>ld 

-4 

-4.2 

-4.3 

-6.3 

-8 

-9.8 



-4.2 
-4.4 

-5 



-4.4 
-4.5 
-4.6 
-4.6 
-4.7 
-.4.8 
-4.9 
-5.4 
-5.4 
-5.4 
-6.8 
-5.9 



-4.2 
-4.7 
-4.9 
-5.9 
-6.2 
-6.7 
-10 
-15 



-4.4 
-4.5 
-6 
-6.9 
-7.8 
9.6 



L04270 


-4.4 


U08471 


-5 


U11875 


-5.3 


D25304 


-4.5 


M12I74 


-4.5 


U37518 


-6.6 


L19605 


-4.3 


M80563 


-4.8 


X05908 


-4.8 


L09604 


-4.9 


HG1614-HT1614 


-5 


M32304 


-5.1 


D83782 


-5.2 


D87436 


-5.6 


U 10485 


-5.8 


U69546 


-6.9 


U79273 


-7 


L34657 


-8 


Z50781 


-8.7 


D79985 


-9 


U2595G 


-32 



tion. Nevertheless, the reliance upon DNA microarrays alone 
affords insight only into the transcriptional response without 
corroboration at the protein level. In the present study, appli- 



cation of both DNA microarray and proteomics technology to 
our model system provides unique insight into both the cellular 
biology of the activated PMN and the responsiveness and reg- 
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FrG. 1. Two-dimensional PAGE of LPS -exposed human PMNs. 
A and B, colloidal Coomassie Blue-etained pH 3.0-10.0, tsvo-dimen- 
sional PAGE gek (A, control; B, LPS- exposed) with up-regulated {solid 
arrows) and down-regulated (hatched arrows) proteins indicated. These 
results are representative of six separate experiments. C and Z), colloi- 
dal CoomasBie Blue-etaincd pH 6.0-6.0, two-dimensional PAGE gels 
(C, control; D t LPS-exposed) with up-regulated (solid arrows), new 
(solid arrou.\ open arrowhead) > and down- regulated (hatched arrows) 
proteins indicated. LPS-orposed PMNs from three blood donors were 
pooled. 

illation of its transcriptional and translational machinery- As 
will be discussed below, our study identifies, in particular, 
novel aspects of the LPS-stimulated PMN transcriptional reg- 
ulation, activity in the innate immune response, signaling, 
cytoskeletal reorganization, and priming for granule release. 

In the present study, the increase in NF-kB transcript abun- 
dance (Table I), detected by the microarrays corroborates the 
findings of other studies of PMNs and monocytes (40) and 
indicates a mechanism for the responsiveness and scope of the 
PMN transcriptional machinery following LPS exposure. NF- 
kB, recently described to be activated by LPS through the 
TLR/MyD88/interleukin-l receptor-associated kinase pathway 
(1, 4), is the only transcriptional complex reported to be in- 
duced by LPS in the PMN. However, because the transcrip- 
tional NF-kB complex has been implicated in the regulation of 
only a portion of the genes induced by LPS in this study (data 
not shown), the importance of alternative transcriptional reg- 
ulators in the PMN is clear. Of interest, several other known 
and putative transcriptional regulators with less well defined 
functions were also up-regulated in the present study, includ- 
ing PLAGL2, n putative zinc-finger protein, XBP-1, MTF-1, 
Ets-2, B-ATF, and DIF-2. On the other hand, LPS-down-regu- 
lated genes include ATF-2 (a known target of p38), NFATC4, 
TOB-1, NF-E2. MXI-1, and LYL-1. Although the exact role of 
these gene products in regulating cell function is unknown, 



these data indicate that the range of transcriptional responses 
in the LPS-stimulated PMN is much broader than previously 
suggested and that the signaling capabilities of the PMN in the 
immune response are thereby likely extended in scope and 
specificity. 

As expected from the literature, the genes for several cyto- 
kines and chemokines, including IL-1& IL G, and MIP-lp, were 
found to be up-regulated (Table I). On the other hand, the 
notable absence of up-regulated cytokines in the proteomics 
experiments reflects their removal in the post-LPS incubation 
wash performed prior to lysis for two-dimensional-PAGE. Up- 
regulation of these inflammatory mediators is well documented 
in PMNs exposed to LPS and in animal models of LPS-induced 
sepsis syndrome and acute respiratory distress syndrome, a 
PMN-mediated illness (41, 42). Several genes in this family 
were up-regulated that have not, to our knowledge, been de- 
scribed in LPS-stimulated cells, including MCP-1, GR03 t 
IL-10RA, and HM74, an orphan G protein-coupled receptor 
with homology to chemokinc receptors. The down-regulation of 
TNFSF10, lyrnphotoxin b receptor, and TNFAIP1 were also 
observed. The modulation of genes involved in cytokine signal- 
ing, including the adapter molecules TRAF1 (LPS and TNF 
receptor signaling) and TNFAIP1 {TNF receptor signaling) and 
several kinases and phosphatases, may indicate a change in 
cytokine responsiveness after LPS treatment. Relevant in this 
regard from the proteomics data are: 1) the up-regulation of 
protein phosphatase 1, which has been shown to regulate PMN 
NADPH oxidase activation and translocation (43, 44) and to 
regulate LPS-induced NF-kB activation (45); 2) the down-reg- 
ulation of Rho-GAPl, which has been shown to regulate 
NADPH oxidase activity in the PMN (46); and 3) the up. 
regulation of P0 4 -stathmin (Table IV), a phosphoprotein pos- 
tulated to function as a re layer and integrator of multiple 
signal transduction pathways (34). Several non cytokine, 
nonchemokine genes involved in the immune response were 
also up-regulated, including the complement pathway mem- 
bers C5, C3ARL and PFC\ the protease inhibitors ELANH2 
(elastase inhibitor), SLPI, PI-3> and PI 9; and the acute phase 
protein orosomucoid. LPS regulation of C3AR1 and orosotnu- 
coid expression have not previously been reported. In the pro- 
teomics experiments, the down-regulation of ficolin-1 (Table BI), 
a collectin-like cell surface protein reported to activate the com- 
plement system and to mediate adhesion and phagocytosis in 
monocytes but not previously reported in granulocytes (47), may 
represent negative modulation of the innate immune response. 
The finding that genes other than cytokines and chemokines are 
regulated by the PMN in response to LPS indicates that the PMN 
plays a more sophisticated role in host-defense and immunity 
than previously thought. 

Treatment of the PMN with LPS lead to the induction of a set 
of genes associated with the anti-viral Type I interferons, 
IFNa/|3. This induction occurs independently of the release of 
IFN or another unidentified soluble factor. 2 Furthermore, the 
set of genes expressed is smaller than that induced by IFNoy/3, 
as described by Der et aL (12). This may be due to differences in 
the scope of the signaling systems activated by LPS and 
lFNtt//3, or the time course of analysis of genes in the LPS- 
stimulated PMN. The implication that LPS treatment of PMN 
allows PMN to express an ti- viral activity is currently being 
tested. Of interest was the finding that induction of interferon- 
stimulated genes was blocked by pretreatment of PMNs with 
SB203580. Work from our laboratory has indicated that signal 
transducers and activators of transcription activation does not 
occur in response to LPS in PMNs. 2 In addition, interferon- 



2 K. C. Malcolm and G. S. Worthcn, manuscript in preparation. 
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Tabik in 

Analysis ofpH 3.0-10.0 two-dimensional PAGE gels 
Mean changc(-fold) in expression level among six PMN donors ia reported. The change in expression for the proteins Hated was statistically 
significant (p < 0.05) as measured by a two-tailed Students t test. 



Identification [spot no.l 


Swias-Prot no. 


Estimated 


Theoretical 
Afa/pl 


Peptides matched/ 
submitted 


Protean 
covered 


Mean 
change 










% 


% 


■fold 


Up-regulated 








9A12(75%) 


36% 


1.51 


Proteasome ft chain [646] 


P28070 


27/5.7 


29.2/5.72 


Amiexin III 1550] 


P12429 


31/5.7 


36.4/5.6 


14/18 (78%) 


42% 


1.37 


Actin fragment [5441° 


P02570 


32/5.5 


(41.7/5.29) 


13/15 (87%) 


(34%) 


1.74 


Actin fragment 15911" 


P02570 


30/5.4 


(4L7/5.29) 


Wl8 (78%) 


(29%) 


1.60 


ff-Knblase (380] 


P06733 


4175.7 


47.2/7.01 


9/10 (90%) 


24% 


1.65 


Rab-GDP dissociation inhibitor 0 {289) 


P50395 


50/6.1 


50.7/6.11 


10/11 (91%) 


25% 


1.24 


Glutathione S-transferase P [64SI 


P09211 


23/5.5 - 


23.4/5.43 


6/8 (75%) 


41% 


1.54 


Pre-B-cell colony enhancing factor 11152] 


P43490 


53/7.0 


55.5/6.69 


12/16 (75%) 


25% 


1.29 


Down -regulated 








16/22 (73%) 


34% 


0.53 


Adenylyl cyclase-associated protein 1 [256] 


Q01518 


55/7.3 


51.7/8.07 


Rho-GAPl [283] 


Q07960 


50/5.8 


50.4/5.85 


7/9 (78%) 


22% 


0.67 


Ficolin 1 [oil] 


O006O2 


33/6.5 


35/6.39 


10/12 (83%J 


26% 


0.74 



" The theoretical pi and Af R of native actin are indicated. Protein coverage indicates coverage of native actin. 

Table IV 

Analysis of pH 5.0-6.0 two-dimensional PAGE gels 
Results are from pooled samples for control (n~3) and LPS-exposed (n = 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. "New* designates proteins seen in the LPS gel in two repeat experiments but 
not detectable in the corresponding control gels. 



Identification [spat no.) 



Up-regulated 
Protein- tyrosine kinase 9-like [468] 
Protein phosphatase 1, catalytic summit, p isoform 
[378] 

P0 4 -stathmin (577] 

Konmuscle myosin heavy chain [1102] 

Putative P0 4 -nonmnscle myosin heavy chain [110 if 

Leukocyte elastase inhibitor [316] 

Graacalcin [1004] 

Down-regulated 

Adenosylhomocysteinase [324] 

PEST phosphatase interacting protein homoleg [234K 



Swiss-Prct 
no. 


Estimated 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Change 








% 


% 


-fold 


Q9Y3F5" 


34/5.81 


39.5/6.37 


10/14 (71%) 


34% 


1.8 


P37140 


38/5.73 


37.2/5.84 


7/10 (70%) 


22% 


2.0 


P16949* 


18/5.36 


17.3/5.76 


9/12(75%) 


42% 


2.1* 


189036° 


145/5.32 


145/5.23 


20/21 (95%) 


17% 


New 


189036^ 


145/5.29 


145/5.23 


14/16 (87%) 


13% 


New 


P30740 


42/5.71 


42.7/5.9 


9/13 (69%) 


22% 


2.4 


P28676 


24/5.36 


24.0/5.02 


7/10 (70%) 


31% 


New 


P23526 


4875.82 


47.7/6.04 


7/9 (78%) 


14% 


0.4 


41Q0162' 


48/5.30 


47.6/5.35 


11/13 (85%) 


30% 


0.5 



n TrEMBL accession number. 

h Accession number and theoretical pi and M tt for the unmodified protein are indicated. 
c WCBI accession number. 
d See text for explanation. 

* Among three experiments, the ratio of P0 4 -stathmin expression increase, following LPS exposure in the presence of SB203580 divided by that 
in the absence of SB203580, was 0.93. 
f Genpept accession number. 

e This se-arch was performed using average masses measured by linear mode MALDI-TOF MS. 

Table V 

Analysis ofpH 5.5-6.7 two-dimetisional PAGE gels 
Results are from pooled samples for control (n ~ S) and LPS-exposed (n ~ 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. 



Identification [spot no .J 



Swiss-fret 



Estimated 



Theoretical 



Peptides matched/ 
submitted 



Protein 
covered 



Change 



Up-rcgulatcd 

Transnldolase 14751 . P37837 38/5.95 

leocitrate dehydrogenase 1431] 075874 46'*6.25 

Moesin (2011 P26038 61/6.09 

/r-Enolase [459] P06733 43/5.64 

Down-regulaied 

Calponin 112 (240) Q99439 34/6.G5 





% 


% 


-f,M 


S7.5/6.36 


13/17(76%) 


33% 


2.5 


46.7/6.35 


7/7(100%) 


13% 


2.3 


G7.8/6.07 


11/13(85%) 


17% 


2.1 


47.2/7.01 


7/10 (70%) 


17% 


3.8 


33.7/6.94 


10/11(90%) 


27% 


0.5 



regulatory factor 3, a known regulator of in terteron- stimulated 
gene transcription, is not a direct target of p38 kino.se. 2 There- 
fore, gene expression analysis of LPS-stiinulated PMNs has 
uncovered a previously uncharacterized eignal transduction 
system that is sensitive to inhibition of p38 MAPK. 

Knowledge of the genes down -regulated by LPS permits the 



development of further hypotheses addressing PMN function in 
the face of infection. Strikingly, several down-regulated genes 
and gene products are structural in nature {e.g. paxillin, acti- 
nin, cnlponin H2) (Tables II and V). A known consequence to 
the PMN of LPS exposure is decreased motility (48). Up-regu- 
lation of genes for adhesion molecules (/CAM- 7, CD44, AL- 
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Fig. 2. A, the predicted sequence of the tryptic phosphopeptide in 
P0 4 -atalhmin (.1468.72 Da). The peptide maas measured by MAI X)I- 
TOF MS and the predicted mass differed by 14 ppm. As indicated, two 
alternate phosphorylation Bites are possible: serine 16 and serine 25. B t 
P0 4 -stathmin and stathmin were identified on the control and LPS- 
exposed pH 5.0-6.0 gels. Consistent with phosphorylation, the P0 4 - 
stalhmin spot was distinguished by a peptide of mass 1468.72 Da (Le. 
80 Da greater than the peptide of 1388.72 Da seen in the atathmin spot). 
Assuming that no other multiply phosphorylated atathmin species haye 
escaped detection, analysis of the integrated intensities of the P0 4 - 
stathinin and stathmin apots indicates that the percentage of the P0 4 
form of total cellular stathmin has increased from 1 \% to 38% with LPS 
stimulation. The decrease in integrated intensity for stathmin was 
equal in amount to the increase in P0 4 -stathmin following LPS 
exposure. 

CAM, and TSG-6), and down-regulation of genes for structural 
proteins, indicates a genetic basis for this observation. Down- 
regulation of two genes implicated in cytoskeletal regulation, 
Pix-a and RhoB t was also observed. The calcium-binding pro- 
tein S100A4, down-regulated in LPS-treated PMNs (Table II). 
has been implicated in celt motility and metastasis (49). De- 
creased motility may be beneficial in sustaining the inflamma- 
tory response at sites of infection. In addition, LPS treatment 
results in an inhibition of apoptosis (50). Therefore, the longer 
residence time of the PMN at sites of infection is consistent 
with the long term genetically coded changes seen in these 
gene- profiling experiments and indicates that the changes in 
gene expression are functionally relevant to host defense and 
immunity. 

By providing information on post-translational modification, 
the proteomics data may provide further insights into the cy- 



Tabik VI 

Effect ofSB203580 on LPS-stimulated gene expression 
Genes are reported for which the SB203580/control expression ratio 
is < 0.60. 



Gene name 


•fiilil rnflnffti infill 

(SB203580fcantrol) 


«fSB20358i) 






-fold 


ISG15 


0.09 


22.5 


HCR 


0.38 


20.8 


Mxl 


0 


19.4 


IFI56 


0 


12.1 


P/-9 


0.57 


9.5 


EU-2 


0.59 


7.4 


IL6 


0.45 


6.3 


Ret 


0.50 


6.2 


UMS1 


0.58 


6.1 


C3AR1 


0.49 


6.1 


INBO 


0.35 


5.2 


KIAA0105 


0.41 


5.1 


SNAP23 


0.58 


6.0 


SLPI 


0.58 


4.7 


ELNAII2 


0.49 


4.G 


HM-74 


0.57 


3.8 


PKR 


0 


3.7 


MAD 


0.21 


3.6 


IFIT4 


0.12 


3.6 


Glycerol kinase 


0 


3.6 


IFI54 


0 


3.5 


IFI58 


0.39 


3.5 


1PF35 


0.46 


3.0 



Table VII 

Effect of SB203580 on LPS-stimulated protem expression 



Pi-ottin name 


-fold rhuitce rutin 
(SB20358t>Wtroi) 


Change in 
ubs en re of 
SB203580 






-fold 


Up-regulaied 






Proteasome 0 chain 


0.8 


1.51 


Annex in III 


0.6 


1.37 


Actin fragment [5441 


0.8 


1.74 


Actin fragment [5911 


0.8 


1.60 


a-Enolase 


0.6 


1.65 


Ha b- GDP dissociation inhibitor 0 


1,1 


1.24 


Glutathione S-trari£fcrase P 


1.2 


1.54 


Pre-B-cell colony enhancing factor 


1.2 


1.29 


Down-regulated 






Adenylvl cyciaee-associated protein 1 


1.3 


0.53 


Rho-GAPl 


0.8 


0.67 


Ficotin 1 


1.0 


0.74 



toskeletal remodeling effects of LPS upon the PMN. We con- 
tend that the actin fragments identified (Table III) are unlikely 
to represent technical artifacts. Rather, their specificity (iden- 
tical molecular weight/pi among different experiments), statis- 
tically significant up-regulation by LPS. as well as the use of a 
lysis buffer containing chaotropes and multiple protease inhib- 
itors orgue instead that these fragments are physiologic con- 
sequences of LPS exposure in the human PMN. More specifi- 
cally, the up-regulation of these fragments following LPS 
exposure (Table III) suggests that LPS may activate an actin- 
cleaving enzyme, which, in turn, remodels the cytoskeleton. 
Intriguing in this vein, calpain has recently been reported to 
play an important role in cell migration and cytoskeletal orga- 
nization of fibroblasts (51). The possibilities that LPS may 
induce calpain activation and that calpain activation may reg- 
ulate cytoskeletal reorganization and motility are currently 
under investigation. An alternative possibility is that actin 
cleavage is a marker of neutrophil apoptosis (52). 

Other LPS-regulated proteins may play important roles in 
cytoskeletal reorganization. The up-regulation of protein-ty- 
rosine kinase 9-like ( AG-related protein) may modulate LPS- 
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Table VIII 

LPSregulated proteins for which a probe was present on the 
Affymetrix chip 

A comparison of corresponding protein and mRNA transcript changes 
following LPS exposure is Bhown. 



iYatcin 



IVotein change 



-fold 



Up-regiilate.d 
Proteaeome 0 chain 
Leukocyte elaBiase inhibitor 
Rab-GDI p 
Grancalcm 
Transaidolase 
Moesin 

Nonmuscle myosin heavy chain 
Glutathione S-transfcrase P 
Pre-B cell enhancing factor 

Isocitrate dehydrogenase — , 

PO<-statlunin 2.1 Absent (stathmin) 

Protein phosphatase 1,0 catalytic suhunit 2 t Absent 
AnnexinUI • 31 3 1 * 



1.5 

2.4 

1,24 

New 

2.5 

2.1 

New 

1.54 

1.29 

2.3 



1.9 t 
4.6 f 

NC 

NC 

NC 

NC 

NC 
Absent 
Absent 
Absent 



Down -regulated 

Adenvlyl cyclase-associated protein 1 
Rho-GAP i 
Picolin 1 

Adenosylhoniocysteinase 
Calponin H2 



1.9 
1.5 
1.4 

2.5 
2 



211 
2.7 i 
1.7 i 
Absent 
NC 



* NC, no measureable change. 



induced uctin polymerization, because it bears a high degree of 
homology to twinfilin (A6), an actin monomer-binding protein 
that localizes to sites of rapid filament assembly in cells and is 
believed to regulate actin filament turnover (53). In turn, LPS- 
induced down-regulation of Rho-GTPase activating protein 1 
(Table III) may regulate twinfilin (and protein-tyrosine kinase 
9-like) activity, because twinfilin has been shown to colocalize 
with Racl and Cdc42 and to be regulated by active Racl in NIH 
3T3 cells (53). Activation of Rho proteins may be facilitated by 
LPS up-regulation of moesin (Table V), because moesin report- 
edly induces the dissociation of Rho from GDI (54). Racl may, 
in turn, promote activation of the actin filament-nucleating 
Arp2/3 complex through interactions with WASP (Wiskott-Al- 
drich syndrome protein) family proteins (55) and, interestingly, 
is postulated to regulate the dynamics of both the actin and 
microtubule cytoskeletons via phosphorylation-of stathmin (Ta- 
ble IV) (56). Calponin H2 ia an actin-binding protein not pre- 
viously reported in PMNs that is postulated to play a role in 
cytoskeletal organization (57). Its down-regulation by LPS (Ta- 
ble V) likely modulates LPS-induced cytoskeletal reorganiza- 
tion. The up-regulation of nonmuscle myosin heavy chain and a 
putative phosphorylated form of myosin heavy chain (putative 
protein kinase C substrate by prediction rules) in the LPS- 
exposed PMN (Table IV) is of uncertain significance; myosin 
has been implicated in multiple functions in the PMN, includ- 
ing locomotion, fluid pinocytosis, and phagocytosis (58). Of 
interest, however, S100A4 (down-regulated, Table II) has been 
reported to regulate cytoskeletal dynamics by inhibiting pro- 
tein kinase C-mediated phosphorylation of nonmuscle myosin 
heavy chain (59). 

LPS induction of stathmin phosphorylation (Table TV and 
Fig. 2) may represent another mechanism by which the cy- 
toskeleton is remodeled. Stathmin is a phosphoprotein report- 
edly involved in both signal transduction and in regulation of 
the microtubule filament network; furthermore, phosphoryla- 
tion of stathmin has been reported to modulate its tubulm- 
binding avidity (60). Inferences can be made about both the 
phosphorylation site on P0 4 -stathmm and the responsible ki- 
nase induced bv LPS. Four phosphorylation sites in stathmin 
have been well described: Ser 16 , Ser 26 , Ser* 8 , and Ser 63 (32, 33). 



Ser 16 has been reported as a substrate for Ca 2+ /calmodulin 
(CoM)-dependent kinases (32), and Ser 25 as primarily a sub- 
strate for p38 and ERK (33), with P 34 ,5dc2 also active but bear- 
ing a 5-fold preference for Ser 33 (34). As stated above, the 
phosphopeptide identified in P0 4 -stathmin, extending from 
residues 15 to 27 (1468.7 Da), is consistent with phosphoryla- 
tion of either Ser 16 or Ser 25 (Fig. 2). Although both P 388 and 
p38a MAPK isoforms are expressed in the human PMN, LPS 
has been shown to selectively activate the p38a isoform in 
human PMNs (9). The p38a isoform, however, has been shown 
to be relatively inactive at Ser 25 ; in fact, p38S is -100-fold more 
active at Ser 25 , and selective p38or inhibitors do not inhibit the 
stress-activated phosphorylation of stathmin in 293 cells (33). 
Further support for the lack of involvement of p38 signaling in 
phosphorylation of stathmin in our system is the apparent lack 
of effect of SB203580 (a selective p38« and p380 inhibitor) on 
LPS-induced expression of P0 4 -8tathinin (Table IV). Because 
p34 cdd2 i8 re i a tively inactive at Ser 25 (34), we conclude that the 
phosphorylation site is likely to be Ser 16 , a reported substrate 
of CaM-dependent kinase. Although CaM kinases have previ- 
ously been implicated in gene activation in LPS-exposed my- 
elomonocytic HD11 cells (61), stathmin signaling has not, to 
our knowledge, been previously reported in either PMNs or 
lipopolyBaccharide signal transduction. 

Cytoskeletal reorganization, a well-described regulator of 
granule release (62), may underlie LPS-induced priming for 
PMN granule release, but several LPS-regulated proteins may 
provide more specific clues. LPS exposure led to increased 
levels of grancalcin, a calcium-binding protein previously de- 
tected in PMNs and shown to translocate to granules ami 
plasma membrane in the presence of physiologic concentra- 
tions of calcium (63). Similarly, annexin HI, a calcium-binding 
protein highly expressed in PMN granule membranes and im- 
plicated in calcium-mediated secretion (64) and in granule fu- 
sion (65), was also found to be up-regulated. Exocytosis of 
granule contents may also be facilitated by LPS up-regulation 
of Rab-GDP dissociation inhibitor (Table III), which has been 
proposed to recycle Rab after vesicle fusion by extracting it 
from the membrane and loading it onto newly formed transport 
intermediates (66). 

Parallel use of DNA microarrays and proteomics affords a 
powerful strategy for comparison of corresponding mRNA tran- 
scripts and proteins, thereby affording new insight into the 
mechanisms by winch the cell regulates its signaling responses 
to the external environment. Of interest, a poor correlation was 
found between corresponding transcripts and proteins (Table 
VIE), as reported in other systems (17, 18). The finding in some 
cases of unchanged transcript abundance in the face of regu- 
lated protein levels indicates post-transcriptional modulation 
following LPS exposure. The finding of undetected transcripts 
in the face of regulated levels of the corresponding proteins 
may indi cate previous transcri ption of these genes in an earlier 
state of the myeloid maturation of the PMN, producing stable 
protein species that have undergone post-translational alter- 
ation following LPS exposure. The use of SB203580, a p38 
inhibitor, adds further insights into the mechanisms of LPS 
regulation. At the level of mRNA expression, SB203580 inhib- 
ited 23% of LPS-stimulated genes by > 40% and 11% of genes 
by >:60%; therefore, p38 plays a specific role in gene regulation 
in the PMN. In particular, proteasome (J chain was up-regu- 
lated at both the mRNA transcript and protein level (Table 
VIII), with no notable effect of SB203580 on expression at 
either level, consistent with a non-p38-mediated pathway of 
primary transcriptional up regulation induced by LPS. Simi- 
larly, CAP1, Rho-GAPl, and ficolin 1 were down-regulated at 
both the mRNA transcript and protein level (Table VUI), with 
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no notable effect of SB203580, consistent with a non~p38-me- 
diated pathway of primary transcriptional down-regulation. 
Interestingly, anriexinlll was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Table VII), 
consistent with a p38-mediated post-transcriptional'up-regula- 
tion induced by LPS. 

Limitations of the present study should be noted. Gene ex- 
pression analysis by cDNA microarrays does not distinguish 
between transcriptional regulation and mRNA stabilization; 
similarly, two-dimensional PAGE proteomics by itself does not 
distinguish among transcriptional, translational, or post-trans- 
lational regulation of protein abundance. Transcript detection 
by microarray technology is limited to the probes included; 
protein identification by two-dimensional PAGE proteomics is 
limited to well-resolved regions of the gel, may perform less 
well with hydrophobic and high molecular weight proteins, and 
tends to select for more abundant protein species (30). Harvest- 
ing of the LPS-incubated PMNs at 4 h may have prevented 
detection of earlier, transient changes and may have thereby 
introduced artifactual transcript-protein discordance. Further- 
more., the post-LPS incubation, pre-two-dimensional PAGE cell 
washes would be expected to remove secreted proteins from 
further analysis, with uncertain effects on detected protein 
abundance depending on such factors as the degree of de novo 
synthesis and extent of degranulation/exocytosis. Because pro- 
tein binding of Coomassie Blue has a limited dynamic range 
and is typically not linear throughout the range of detection, 
image analysis of Coomassie Blue-stained protein spots should 
be considered semi-quantitative. For some protein spots, the 
apparent magnitude of regulation by LPS may have been 
blunted by the spot approaching staining saturation in the 
control gel. By limiting our analysis to those protein spots 
common to all twelve pH 3.0-10.0 two-dimensional gels, we 
likely excluded some LPS-regulated proteins that happened to 
be either poorly resolved on a subset of the gels or unmatched 
by the image analysis software. By further limiting the analy- 
sis to those matched spots on the pH 3.0-10.0 gels for which a 
two- tailed t test demonstrated p < 0.05, the list of regulated 
proteins was likely also limited by statistical power. In addition 
to those regulated proteins listed in Table III, three others were 
up-regulated and three down-regulated with p < 0.09 (data not 
shown). 

Limiting our reported results to those changes that met 
statistical significance among the donors carries further impor- 
tant implications. We have encountered a two order of magni- 
tude range of response in unselected donor LPS-induced PMN 
functions, such as TNF-a and superoxide anion release (data 
not shown). The sources of this physiologic heterogeneity re- 
main uncertain but may possibly include Buch factors as nat- 
ural mutations of the LPS receptor component, TLR4 (67). By 
selecting for LPS effects common to all donors, we may not have 
characterized the range of genomic and proteomic heterogene- 
ity present in the population and thereby may have focused on 
only a narrow portion of a broader biological response to LPS. 
We contend that this reductionist approach is valid because it 
would be expected to enrich for biologically integral resjwnHes 
of the PMN to LPS. Nevertheless, correlation of genomic and 
proteomic profiles with functional phenotypes of the PMN may 
bear important diagnostic and therapeutic implications and 
will be pursued in future studies. 

Widespread regulation of numerous noncytokine/chemokine 
genes and proteins in the LPS-stimulated human PMN is a 
novel finding. These data indicate that, despite a narrow scope 
of gene expression in the none ti inula ted state, the terminally 
differentiated, short- lived PMN likely plays a role in the innate 



immune response that is far more sophisticated and dynamic 
than the simple release of preformed inflammatory mediators. 
Although gene expression appears to be an important mecha- 
nism by which PMNs respond acutely to infection, mRNA tran- 
script/protein concordance is limited, and post-transcriptional 
(and post-translational) modifications also play an important 
role. The alteration of multiple transcriptional regulators, G- 
protein regulators, P0 4 -stafchmin, and protein phosphatase 1 
indicates that one of the responses to LPS exposure is to modify 
subsequent signaling events by bacterial components or by 
other cytokines and chemokines. Finally, the finding that p38 
MAPK mediates LPS regulation of a limited subset of tran- 
scripts and proteins underlines the continuing need to define 
signal transduction cascades in the neutrophil. 
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TA3LE I 

Correlation coefficients of protein and mRNA where only one spot was present on 20 geis 
r*. correlation coeffici ent value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 



Spot 


. Unigene 


Gene name 


r* 




ns. io*»oiu 


SFN 


0.4337 


0994 


He 77D An 


ANXA4 


0.4219 


1 31 4 


ns. sUaoo 


DJ-1 


0,3982 


1454 


MS. (o^co 


SOD1 


0.3863 


1638 


nS.«f to . 


LGALS1 


0.3318 


0264 


ns. ! ZdOHO 


HNRPK 


0.3034 




U c HI OH 


FTL 


0.2849 


0963 


rts.oUU/ 1 i 


ANXA5 


0.2468 




nS.4/4;> 


PSMC 


0.2445 


nQnfi 
uyuo 


to A A an 
MS.<;a44a9 


LDHB 


0.4420 


1171 
1 1 r 1 


HS. 241515 


COX11 


0.2310 


1 1Kfl 


ns. ',81013 


PGAM1 


0.2023 


IY7CQ 


Hs.74635 


DLD 


0.1965 


1193 


Hs.83383 


AOE372 


0.1932 


0172 


Hs.3069 


HSPA9B 


0.1872 


0777 


Hs.979 


PDHB 


0.1855 


1249 


Hs.226795 


GSTP1 


0.1773 


1685 


Hs.76136 


TXN 


0.1732 


1205 


Hs.82314 


HPRT1 


0.1 58B 


1230 


Hs.279860 


TPT1 


0.1466 


0603 


Hs.181357 


LAMR1 


0.1463 


1358 


Hs.28914 


APRT 


0.1399 


1410 


Hs.82113 


DUT 


0.1213 


1825 


Hs. 11 2378 


LIMS1 


0.1213 


0871 


Hs.250502 


CA8 


0.1122 


0289 


Hs. 82916 


CCT6A 


0.1106 


1 1 43 


Hs.11465 


GSTTl_p28 


0.0997 


1 456 


Hs.1 18638 


NME1 


0.0932 


1598 


Hs.278503 


RIG 


0.0905 


1354 


Hs.89761 


ATP5D 


0.0904 


1445 


Hs.1 55485 


HIP2 


0.C843 


1479 


Hs.1 77486 


APP 


0.0746 


0608 


Hs.1 82265 


KRT19 


0.0439 


1071 


Hs. 10842 


RAN 


0.0277 


0991 


Hs,297939 


CTSB 


0.0254 


0842 


Hs.77274 


PLAU 


0.02<18 


0823 


Hs.198248 


B4GALT1 


0.0183 


0513 


Hs.1 247 


APOA4 


0.0176 


1338 


Hs.104143 


CLTA 


0.0123 


0902 


Hs.5123 


SID6-306 


0.0117 


1688 


Hs.1473 


GRP 


-0.0040 


0265 


Hs.274402 


HSPA1B 


-0.0071 


1414 


Hs.77541 


ARF5 


-0.0096 


0710 


Hs.97206 


HIP1 


-0.0114 


0532 


Hs.1 70328 


MSN 


-0.0132 


0525 


Hs.284255 . 


ALPP 


-0.0148 


0513 


Hs.76901 


PDIR 


-0.0289 


1659 


Hs.256697 


HINT 


-0.0312 


1262 


Hs.7016 


RAB7 


-0.0362 


0190 


Hs.184411 


ALB 


-0.0470 


0948 


Hs.2795 


LDHA 


-0.0549 


0502 


Hs.180532 


GPI 


-0.0575 


0152 


Hs.75410 


HSPA5 


-0.0640 


1054 


Hs.74276 


0LIC1 


-0.068'6 


0709 


Hs.253495 


SFTPD 


-0.0936 


0867 


Hs.78996 


PCNA 


-0.0982 


0165 


Hs.1 8041 4 


HSPA8 


-0.1014 


1109 


Hs.75103 


. YWHA2 


-0.1018 


0137 


Hs.554 


SSA2 


-0.1032 



Protein name 



14-3-3 a 

Annexin IV 

DJ-1 protein/MER5 

Superoxide dismutase (Cu-Zn) 

Galectin 1 

Transformation up-regulated nuclear protein 
Ferritin light chain 
Annexin V 

26 S proteasome p28 

L-lactate dehydrogenase H chain (LDH-B) 

COX 11 

Phosphoglycerate mutase 
Dihydrolipoamlde dehydrogenase precursor 
Antioxidant enzyme AOE372 
GRP75 

Pyruvate dehydrogenase El-p subunit precursor 
Glutathione S-transferase pi (GST-pi) 
Thioredoxin 

HG phosphoribosyttransf erase 
Translationally controlled tumor protein (TCTP) 
LAMR 

Adenine phosphoribosy! transferase 
dUTP pyrophosphatase (dUTPase) 
Pinch-2 protein 

Carbonic anhydrase-related protein; Syntaxin 
Chaperonin-like protein 

Glutathione S-transferase homolog (GST homolog) 
Nm23 (NDPKA) 
RUG (U32331) 

FIFO-type ATP synthase subunit d 
Huntingtin interacting protein 2 (HIP2) 
Amyloid B4A 
Cytokeratin 19 

GTP-binding nuclear protein RAN(TC4) 
Cathepsin B 

Urokinase plasminogen activator 

J3 1,4-ga'actosyi transferase 

Apolipoprotein A4 (ApoA4) 

Clathrin light chain A 

Cytosolic inorganic pyrophosphatase 

Preprogastrin-releasing peptide 

Heat shock -induced protein 

ADP-ribosylation factor 1 

Huntingtin interacting protein 1 (HIP1) 

Moesin/E 

Alkaline phosphate, placenta! 

Protein disulfide isornerase-related protein 5 

Protein kinase C inhibitor 

Rab 7 protein 

Albumin 

Lactate dehydrogenase-A (LDHA) 

Hsp89 

GRP76 

Nuclear chloride channel (RNCC protein) 
Pulmonary surfactant protein D 
PCNA 

Heat shock cognate protein, 71 kDa 
14-3-3 fc'A 
Ro/ss-A antigen 



Molecular & Cellular Proteomics 1A 305 



Protefn and mRNA Correlation in Lung Adenocarcinomas 



Jm\S: I — continued 



Spot 


Unigene 


Gene name 


r* 


Protein name 


0278 


Hs.4112 


TCP1 


-0.1237 


T-complex protein 1, a subunit 


1769 


Hs.9614 


NPM1 


- 0.173B 


B23/numatrin 


0089 


Hs.74335 


HSPCB 


-0.2049 


Hsp90 


2511 


Hs.153179 


FABP5 


-0.2109 


E-FABP/FABP5 


1739 


Hs.16488 


CALR 


-0.2344 


Calreticulin 32 


1138 


Hs.301961 


GSTM4 


-0.2438 


Glutathione S-transferase M4 (GST m4) 


2533 


Hs.77060 


PSMB6 


-0.2512 


Macropain subunit A 



non-neoplastic lung tissues. Protein levels were determined 
using quantitative 2D-PAGE analysis, and the separated pro- 
tein polypeptides were identified using matrix-assisted laser 
desorption/ionization mass spectrometry (MALDI-MS). The 
corresponding mRNA levels for the identified proteins within 
the same samples were determined using oligonucleotide 
microarrays. Correlation analyses showed that protein abun- 
dance is likely a reflection of the transcription for a subset of 
proteins, but translation and post-translational modifications 
also appear to influence the expression levels of many indi- 
vidual proteins in lung adenocarcinomas. 

EXPERIMENTAL PROCEDURES 

Tissues— Fifty-seven stage I and 19 stage III lung adenocarcino- 
mas, as well as. nine non-neoplastic lung tissue samples, were used 
for protein and mRNA analyses. Patient consent was obtained, and 
the project was approved by the Institutional Review Board. All tis- 
sues were obtained after resection at the University of Michigan 
Health System between May 1991 and July 1998. Tissues were all 
snap-frozen in liquid nitrogen and then stored at -80 °C. The patients 
included 46 females and 30 males ranging in age from 40.9 to 84.6 
(average 63.8) years. Most patients (66/76) demonstrated a positive 
smoking history. Sixty-one tumor samples were classified as bron- 
chial-derived, 14 were classified as bronchoalveolar, and one had 
both features. Eighteen tumor samples were classified as welt differ- 
entiated, 38 were classified as moderate, and 19 were classified as 
poorly differentiated adenocarcinomas. Hematoxylin-stained cryostat 
sections (5 ^m), prepared from the same tumor pieces to be utilized 
for protein and mRNA isolation, were evaluated by a pathologist and 
compared with hematoxylin- and eosin-stained sections made from 
paraffin blocks of the same tumors. Specimens were excluded from 
analysis if they showed unclear or mixed histology (e.g. adenosqua- 
mous), tumor celkdnrity less than 70%, potential metastatic origin as 
indicated by previous tumor history, extensive lymphocytic infiltration, 
or fibrosis or if the patient had received prior chemotherapy or 
radiotherapy. 

Oligonucleotide Array Hybridization- The HuGeneFL oligonucleo- 
tide arrays (Affymetrix, Santa Clara, CA) containing 6800 genes were 
used in this study. Total RNA was isolated from all samples using 
Trizol reagent (Invitrogen). The resulting RNA was then subjected to 
further purification using RNeasy spin columns (Oiagen). Preparation 
of cRNA, hybridization, and scanning of the HuGeneFL arrays were 
performed according to the manufacturer's protocol (Affymetrix, 
Santa Ctara, CA). Data analysis was performed using GeneChip 4.0 
software. The gene expression profile of each tumor was normalized 
to the median gene expression profile for the entire sample. Details of 
data trimming and normalization are described elsewhere (1 1). 

2D-PAGE and Quantitative Protein Analysis— Tissue for both pro- 
tein and mRNA isolation came from contiguous areas of each sample. 
Protein separation using 2D-PAGE, silver staining, and digitization 



were performed as described previously (12, 13). Our 2D-PAGE sys- 
tem allows us to run 20 gels at one time (one batch). Spot detection 
and quantification were accomplished utilizing Bio Image Visage Sys- 
tem software (Bioimage Corp*. Ann Arbor, Ml). The integrated inten- 
sity of each spot was calculated as the measured optical density 
units x mm 2 . Of the total possible 2000 spots detectable on each gel, 
820 spots on the gel of each sample were matched using a Gel-ed 
match program with the same spots on a chosen "master" gel. In 
each sample, 250 ubiquitously expressed reference spots were used 
to adjust for variations between gels, such as that created by subtle 
differences in protein loading or gel staining. Slight differences be- 
cause of batch were corrected after spot-size quantification. 

Mass Spectrometry and 2D Western Blotting— Preparative 2D gels 
were run using extracts from A549 lung adenocarcinoma cells (ob- 
tained from ATCC) and using the identical experimental conditions as 
the analytical 2D gels, except 30% more protein was loaded. The 
resolved protein gels were silver-stained using successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1 % silver nitrate for 40 
min, and 0.01 4% formaldehyde plus 2% sodium carbonate for 10 
min. For protein identification, protein polypeptides underwent trypsin 
digestion followed by MALDI-MS using a MALD1-TOF Voyager-DE 
mass spectrometer (Perseptive Biosystems, Framingham, MA). The 
masses were compared with known trypsin digest databases using 
the MS-FIT database (University of California, San Francisco; 
prospector.ucsf.edu/ucsfhtml3.2/msfit.htm). Some of the polypep- 
tides included in the analysis had been identified prior to this study on 
the basis of sequencing (14). The identified protein spots used in this 
paper are shown in Rg. 1A The method for 2D-PAGE Western blot 
verification was as described previously (15). The 2D Western blots of 
GRP58 and Op18 are shown in Fig. 1, C and 6; the others, such as 
GRP78, GRP75, HSP70, HSC70, KRT8, KRT18, KPT 19, Vimentin, 
ApoJ, 14-3-3, Annexin I, Annexin II, PGP9.5, DJ-1, GST-pi, and 
PGAM. are described elsewhere. 2 

Statistical Analysis- Missing values were replaced with the mean 
value of the protein spot. The transform x — » log (1 + x) was applied 
to normalize all protein expression values. The relationship between 
protein and mRNA expression levels within the same samples, was 
examined using the Spearman correlation coefficient analysis (16). To 
identify potentially significant correlations between gene and protein 
expression, we used an analytical strategy similar to SAM (signifi- 
cance analysis of microarrays) (1 7), which uses a permutation tech- 
nique to determine the significance of changes in gene expression 
between different biological states. To obtain permuted correlation 
coefficients between gene and protein expression, genes were ex- 
changed first !n such a way that permutated correlation coefficient 
were calculated based on pseudo pairs of genes and proteins. The 
distribution of permutated correlation coefficients became stable after 
60 permutations. This procedure was then repeated 60 times to 
obtain 60 sets of permutated correlation coefficients. For each of the 
60 permutations, the correlations of genes and proteins were ranked 



2 Chen e* al. , submitted for publication. 
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Table II 

Correlation coefficients of protein and mRNA where multiple isoforms were present on 2D gels 
r*. correlation coefficient value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 
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Table II — continued 

Correlation coefficients of protein and mRNA where multiple isoforms were present on 2D gafs 
r*. correlation coefficient value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 



Spot Unigene Gene name r* Protein name 
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protein spots on 2D gels representing 98 genes and com- 
pared protein levels with mRNA levels for a cohort of 85 lung 
adenocarcinomas and normal lung samples. Of the 165 pro- 
tein spots, 69 proteins were represented by only one known 
spot on 2D gels for an individual gene, whereas 96 protein 
spots showed multiple protein products from 29 different 
genes. 2D Western blotting verified the proteins identified by 
mass spectrometry when specific antibodies were available. 
Spearman correlation coefficients of the proteins and their 
associated mRNA for each protein spot were generated using 
all 76 lung adenocarcinomas and nine non- neoplastic lung 
tissues (see Tables I and II, and see Figs. 1 and 2). The 
correlation coefficients (r) ranged from - 0.467 to 0.442 (Fig. 
2D). A total of 28 protein spots (21 genes) were found to have 
a statistically significant correlation between expression of 



such that p p if) denotes the rth largest correlation coefficient for pth 
permutation. Hence, the expected correlation coefficient, p t §, was the 
average over the 60 permutations, p £ (f) - X^... , p p (/y60. A scatter plot of 
observed correlations (ptf)) versus the expected correlations is shown in 
Fig. 20. For this study, we chose threshold A — 0.1 15 so that correlation 
would be considered significant if absolute value of difference between 
P0) and p^f) was greater than the threshold. Twenty- nine (including one 
with observed correlation coefficient -0.4672) of 1 65 pairs of gene and 
protein expression were called significant in such* criteria, and the 
permuted data generated an average of 5.1 falsely significant pairs of 
gene and protein expression. This provided an estimated false dis- 
covery rate (the percentage of pairs of gene and protein expression 
identified by chance) for our data set. 

RESULTS 

Correlation of Individual Proteins and mRNA Expression 
within Each Tumor— We have examined quantitatively 165 
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Rg 1 A digital image of a silver-stained 2D-PAGE separation of a stage i lung adenocarcinoma showing protein spots separated by 
molecular mass {MW) and isoelectric point (Pi). Twenty-eight protein spots whose expression levels are correlated with mRNA abundance are 
indicated by the black arrows. B, the outlined areas of A showing protein GRP58. C, 2D Western blot of GRP58 from the A549 lung 
adenocarcinoma cell line. D, the outlined areas of A showing the protein isoforms of Op18. E, 2D Western blot of Op18 from A549 cells. 



their protein and mRNA (r > 0.2445; p < 0.05). This accounts 
for 17% (28/165) of the 165 protein spots. Among the 69 
genes for which only a single protein spot was known (Table 
l) t nine genes (9/69, 13%) were observed to show a statisti- 
cally significant relationship between protein and mRNA 
abundance (r > 0.2445; p < 0,05). The proteins whose ex- 
pression levels were correlated with their mRNA abundance 
included those involved in signal transduction, carbohydrate 
metabolism, apoptosis, protein post-translational modifica- 
tion, structural proteins, and heat shock proteins (Table III). 

Individual Isoforms of the Same Protein Have Different 
Protein/mRNA Correlation Coefficients-Oi the 165 protein 
spots, 96 represent protein products of 29 genes with at least 
two isoforms. Among these 96 protein spots, 19 (19/96 pro- 
tein spots, 20%) showed a statistically significant correlation 
between their protein and mRNA expression (r > 0.2445; p < 
0.05) (Table II) and represented 12 genes (12/29, 41 %). Individ- 
ual isoforms of the same protein demonstrated different 
protein/mRNA correlation coefficients. For example, 2D -PAGE/ 
Western analysis revealed four isoforms of OP 18 differing in 
regards to isoelectric point but similar in molecular weight. 
Three of the four isoforms (spots 1492, 1493, and 1494) showed 
a statistically significant correlation between their protein and 
mRNA abundance (r - 0.3234, 0.3154, and 0.4003, respective- 
ly). The forth isoform (spot 1488) showed no correlation be- 



tween protein and mRNA expression (r = 0.0495). Similarly, just 
one of five quantified isoforms of cytokeratin 8 (spot 439) dem- 
onstrated a statistically significant correlation between protein 
and mRNA abundance (r = 0.3049; p < 0.05) (Table II). 

In addition to differences in the relationship between mRNA 
levels and protein expression among separate isoforms, some 
genes with very comparable mRNA levels showed a 24-fo!d 
difference in their protein expression. Genes with comparable 
protein expression levels also showed up to a 28-fold vari- 
ance in their mRNA levels. 

Lack of Correlation for mRNA and Protein Expression when 
Using Average Tumor Values across All 1 65 Protein Spots (98 
Genes)— Trie relationship between mRNA and protein expres- 
sion was also examined by using the average expression 
values for all samples. To analyze this relationship using this 
approach, the average value for each protein or mRNA was 
generated using all 85 lung tissue samples. The range of 
normalized average protein values ranged from -0.0646 to 
0.0979 (raw value 0.0036 to 4.1947), and the range for mRNA 
was from 0 to 15260.5 for all 165 individual protein spots. The 
Spearman correlation coefficient for the whole data set (165 
protein spots/98 genes) was -0.025 (Fig. 3A). Even for the 28 
protein spots (Fig. 2D) that were found to have a statistically 
significant correlation between their mRNA and protein, use of 
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the average value resulted in a correlation coefficient value of 
-0.035, which was not significant (Fig. 3B). 

Lack of a Relationship between Protein/mRNA Correlation 
Coefficients and Average Protein Abundance-Jo determine 
whether an absolute protein level might influence the corre- 
lation with mRNA, the mean value of each protein (relative 
abundance) and the Spearman protein/mRNA correlation co- 
efficients among all 85 samples were examined. No relation- 
ship between the protein abundance and the correlation co- 
efficients was obseived (r - 0.039; p > 0.05). A detailed 
analysis of separate subsets of proteins with differing levels of 
abundance (less than -0.0014, targer than -0.0014, or larger 
than 0.0077) also showed a lack of correlation between mRNA 
and protein expression among the 83 (50%), 82 (50%), and 41 
(25%) of 165 total protein spots, respectively (r = 0.016, 0.08, 
and 0.172, respectively). 

Stage-related Changes in the Protein/mRNA Correlation 
Coefficients- To determine whether the 21 genes (28 protein 
spots) showing a significant correlation between the protein 
and mRNA expression among all samples demonstrate 
changes in this relationship during tumor progression, the 
correlations were examined separately for stage I (n = 57) and 



stage III (n - 19) lung adenocarcinomas (Table III). The num- 
ber of non-neoplastic lung samples (n = 9) was insufficient for 
a separate correlation analysis of this group. Many of the 
protein spots represent one of several known protein isoforms 
for a given gene. The majority of genes (1 6/21 ) did not differ in 
the protein/mRNA correlation between stage I and stage III 
tumors indicating a similar regulatory relationship between the 
mRNA and protein spot. GRP-58, PSMC. SOD1, TPI1, and 
VIM, however, were found to demonstrate significant differ- 
ences in the correlation coefficients between stage I and 
stage HI lung adenocarcinomas. For GRP-58, PSMC, and VIM 
the change in the correlation coefficient was because of a 
relative increase in protein expression in stage III tumors. For 
SOD and TPI the change resulted from a relative decrease in 
expression of this specific protein in stage ill tumors. 

DISCUSSION 

Relatively little is known about the regulatory mechanisms 
controlling the complex patterns of protein abundance and 
post-translational modification in tumors. Most reports con- 
cerning the regulation of protein translation have focused on 
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Table III 

Stage-dependent analysis of protein-mRNA correlation coefficients 
in boldface indicate a significant difference between stage I and stage III. 
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Function 



Carbohydrate metabolism; electron transporter 
Phospholipase inhibitor; signal transduction 
Phospholipase inhibitor 

Phospholipase Inhibitor; calcium binding; phospholipid binding 
Signal transduction 
Iron storage protein 

Carbohydrate metabolism (glycolysis regulation) 
Signal transduction; protein disulfide isomerase 
RNA-binding protein (RNA processing/modification) 
Heat shock protein 
Structural protein 
Structural protein 

Signal transduction; celt growth and maintenance 
Apoptosis; cell adhesion; cell size control 
Protein degradation 

Signal transduction (protein kinase C inhibitor) 

Oxidorecuctase 

Carbohydrate metabolism 

Structural protein (muscle); control of heart 

Structural protein 

Signal transduction 



one or several protein products (18). Celis er a/. (19) found a 
good correlation between transcript and protein levels among 
40 well resolved, abundant proteins using a proteomic and 
microarray study of bladder cancer. By comparing the mRNA 
and protein expression levels within the same tumor samples, 
we found that 1 7% (28/1 65) of the protein spots (21/98 genes) 
show a statistically significant correlation between mRNA and 
protein. These proteins appear to represent a diverse group of 
gene products and include those involved in signal transduc- 
tion, carbohydrate metabolism, protein modification, ceil struc- 
ture, heat shock, and apoptosis. These results suggest that 
expression of this subset of 1 65 proteins is likely to be regulated 
at the transcriptional level in these tissues. The majority of the 
protein isoforms, however, did not correlate with mRNA levels, 
and thus their expression is regulated by other mechanisms. We 
also observed a subset of proteins that demonstrated a nega- 
tive correlation with the mRNA expression values; for example 
«-haptoglobin demonstrated a strong negative correlation with 
its mRNA expression values. This may reflect negative feedback 
on the mRNA or the protein or the presence of other regulatory 
influences that are not understood currently. 

Post-translational modification or processing will result in 
individual protein products of the same gene migrating to 
different locations on 2D- PAGE gels (20). Because the identity 
of all possible isoforms for each protein examined has not 
been characterized completely, this may influence the corre- 
lation analyses performed in this study. This is partly because 
of limitations of the 2D-PAGE and mass spectrometry tech- 
nologies (21, 22). Potential inconsistencies between mRNA 
and protein correlations that have been reported may also be 
because of differences, even in the same gene, in the mech- 



anisms of protein translation among different cells or as 
measured in different laboratories (23). 

In this study, we examined 165 protein spots identified in 
lung adenocarcinomas. Ninety-six protein spots, representing 
the products of 29 genes, contained at least two protein 
isoforms. Nineteen of 96 protein spots, representing 12 
genes, were shown to have a statistically significant correla- 
tion between their protein and mRNA expression, suggesting 
that the levels of these proteins reflects the transcription of the 
corresponding genes. Differences in proteirVmRNA correlations 
were found among the individual isoforms of a given protein. For 
example, of the four OP1 8 isoforms, three showed a statistically 
significant correlation between the protein and mRNA expres- 
sion levels. The lack of relationship for the one isoform, how- 
ever, indicates that individual protein isoforms of the same gene 
product can be regulated differentially. This is not unexpected 
and likely reflects other post-translational mechanisms that can 
influence isoform abundance in tissues and cancer. 

In addition to the analyses of the correlation of mRNA/ 
protein within the same tumor samples, we also tested the 
global relationship between mRNA and the corresponding 
protein abundance across all 165 protein spots in the lung 
samples. A protein and mRNA average value for each gene 
was generated using ail 85 lung tissues samples. We ob- 
served a very wide range of normalized average protein and 
mRNA values. The correlation coefficient generated using this 
average value data set was -0.025, and even for the 28 
protein spots that showed a statistically significant correlation 
between individual mRNA and proteins, the correlation value 
was only -0.035. This suggests that it is not possible to 
predict overall protein expression levels based on average 
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Rg. 3. The overall correlation of 
mRNA and protein levels across atl 
165 protein spots (A) and across 28 
protein spots that contained Individ- 
ual r values larger than 0.244 (B) are 
shown. Each protein or mRNA mean 
value was calculated based on all 76 
lung adenocarcinomas and nine non- 
neoplastic lung samples using quantita- 
tive 2D -PAGE and Affymetrix oligonu- 
cleotide microarrays. The Spearman 
correlation coefficients for the two data 
sets (A and 8) were -0.025 and -0.035, 
respectively, indicating a lack of correla- 
tion if mean values for mRNA and protein 
for alt samples is used. 
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mRNA abundance in lung cancer samples. This conclusion is 
also supported by previous results from Anderson and Seil- 
hamer (24), who examined 19 genes in human liver cells, and 
by Gygi et a/. (25), who examined 106 genes in yeast. Both 
studies found a lack of correlation between mRNA and protein 
expression when average or overall levels were used. 

A good correlation was reported when the 11 most abun- 
dant proteins were examined in yeast (25), suggesting that the 
level of protein abundance may be a factor that may influence 
the correlation between mRNA and protein. In the present 
study, a fairly wide range of mean protein values among 165 
protein spots in lung adenocarcinomas was observed, and 
the correlation coefficients also varied from -0.467 to 0.442. 



A comparison between the mean value of each protein and 
the correlation coefficient generated using all 85 tissue sam- 
ples did not reveal a strong relationship between the overall 
protein abundance and the correlation coefficients (r = 0.039; 
p > 0.05). Detailed analysis of different subsets of protein abun- 
dance also failed to show a correlation between mRNA and 
protein expression. Thus in contrast to yeast, a relationship 
between mRNA/protein correlation coefficient and protein 
abundance in human lung adenocarcinomas was not observed. 

The results of this study indicate that the level of protein 
abundance in lung adenocarcinomas is associated with the 
corresponding levels of mRNA in 17% (28 proteins) of the 
total 165 protein spots examined. This was substantially 
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higher than the amount predicted to result by chance alone 
(which was 5.1) and suggests that a transcriptional mecha- 
nism likely underlies the abundance of these proteins in lung 
adenocarcinomas. We also demonstrate that the expression 
of individual isoforms of the same protein may or may not 
correlate with the mRNA, indicating that separate and likely 
post-translational mechanisms account for the regulation of 
isoform abundance. These mechanisms may also account for 
the differences in the correlation coefficients observed between 
stage I and stage HI tumors, indicating that specific protein 
isoforms show regulatory changes during tumor progression. 
Further studies in lung adenocarcinomas will examine the rela- 
tionship between the expression of individual protein isoforms 
and specific clinical-pathological features of these tumors, such 
as the presence of angioiymphatic invasion, and nodal or pleu- 
ral surface involvement. The potential to identify specific protein 
isoforms associated with biological behavior in lung adenocar- 
cinomas would be of considerable interest and will add to our 
understanding of the regulation of gene products by transcrip- 
tional, translations, and post-translational mechanisms. 

Acknowledgments-^ thank Kerby A. Shedden, Rork D. Kuick, 
Eric Puravs, Robert Hinderer, Melissa C. Krause, and Christopher 
Wood for assistance in this study. 

* This work was supported by NCI, National Institutes of Health 
Grant U19 CA-85953. The costs of publication of this article were 
defrayed in part by the payment of page charges. This article must 
therefore be hereby marked "advertisement" in accordance with 18 
U.S.C. Section 1734 solely to indicate this fact. 

To whom correspondence should be addressed; General Tho- 
racic Surgery, SRB II, B560, Box 0686, University of Michigan Medical 
School, Ann Arbor t Ml 48109-086; E-mail: dcbeer@urnich,edu. 

REFERENCES 

1 Ideker T Thorsson, V., Ranlsh, J. A., Christmas, R., Buhler, J., Eng, J, K., 
Bumgarner, R„ Goodiett, D. a, Aebersold, R, Hood, L (2001) integrated 
genomic and proteomic analyses of a systematically perturbed meta- 
bolic network. Science 292, 929-934 

2. Liang, P., Pardee, A. B. (1998) Differential display. A general protocol. Mo/. 

Biotechnol. 10, 261-267 

3. Porter, O. A., Krop, I. E., Nasser, S., Sgroi.'D., Kaelin, C. M., Marks, J. R.. 

Rjggins, G., Polyak, K. (2001) A sage (serial analysis of gene expression) 
view of breast tumor progression. Cancer Res. 61, 5697-5702 

4. Bittner, M., Mettzer. P., Chen, Y., Jiang, Y., Seftor, E., Hendrix, M. ; 

Radrnacher. M., Simon, R.. Yakhini t JL, Ben-Dor, A. t Sampas, N-, 
Dougherty, E., Wang, E„ Marincola, F., Gooden, C, Lueders, J., Glat- 
tettRr A., Pollock. P., Carpten, J., Gilianders, E„ Leia, D. ( Dietrich, K., 
Beaudry, C, Berens, M. f Alberts, D. ( Sondak, V. (2000) Molecular 
classification of cutaneous malignant melanoma by gene expression 
profiling. Nature 406, 536-540 

5. Fung, E T., Wright, G. L., Jr., Dalmasso, E. A. (2000) Proteomic strategies 

for biomarker identification: progress and challenges. Curr. Op/n. Mol. 
Ther. 2. 643-650 

6. Davidson, D., BaJdock, R. (2001) Bioiniormatics beyond sequence: map- 

ping gene function in the embryo. Nat. Rev. Genet 2, 409-417 
7 Chee M Vane. R. t Hubbeli, E., Bemo, A., Huang, X. G., Stern, D. t Winkler. 
J.. Lockhart D. J., Morris, M. S., Fodcr, S. P. (1996) Accessing genetic 
information with high-density DNA arrays. Science 274, 610-614 



8. Wane, D. G., Fan, J. B., Siao, C. J., Berno, A., Young, P., Sapoisky. R., 

Ghandour, a, Perkins, N., Winchester, E,, Spencer, J., Krugiyak, L, 
Stein. L, Hsie, L, Topaioglou, T., Hubbeli, E M Robinson, E., Mittmann, 
M., Morris, M. S.. Shen, N., Kilburn, D., Rioux, J., Nusbaum, C> Rozen. 
S.,' Hudson, T. J., Under, E S. (199B) Large-scale identification, map- 
ping, and genotyping of single-nucleoiide polymorphisms in the human 
genome. Science 280, 1077-1082 

9. Pease, A. C, Solas, D., Suilivan, E J., Cronin, M. T M Holmes, C. P., Fodor, 

s P. (1994) Light-generated oligonucleotide arrays for rapid DNA se- 
quence analysis. Proc. Nat!. Acad. Sci. U. S. A 91. 5022-5026 
10 Bhattacharjee, A, Richards. W. G., Staunton, J., Li, C, Monti, S., Vasa, P., 
Ladd. C., Beheshtj, J., Bueno, R., Gillette, M„ Loda, M.. Weber, G., Mark. 
E. J., Lander, E. S., Wong, W„ Johnson, B. E, Goiub, T. a, Sugarbaker, 
D- j!, Meyerson. M. (2001) Classification of human lung carcinomas by 
mRNA expression profiling reveals distinct adenocarcinoma subclasses. 
Proc. Natl. Acad. Sci. U. S. A 98, 13790-13795 

1 1. Giordano, T. J., Shedden, K. A., Schwartz, D. R-, Kuick, R., Taylor, J. M. G., 

Lee N., Misek. D. E„ Greenson, J. K., Kardia, S. L. R., Beer, D. G., 
Rennert, G., Cho, K. R. s Gruber, S. B., Fea^on, E R., Hanash, S. (2001) 
Organ-specific molecuiar classification of lung, colon and ovarian adeno- 
carcinomas using gene expression profiles. Am. J. Pathol. 159, 1231-1238 

12. Strahier, J. R., Kuick, R., Hanash, S. M. (1989) in Protein Structure: A 

Practical Approach (Creighton, T., ed) pp. 65-92, IRL Press, Oxford 

13. Merril, C. R., Dunau, M. L, Goldman, D. (1981) A rapid sensitive silver Stan 

for poiyoeptides in polyacryl amide geis. Anal. Bioch&m. 101, 201-207 

1 4. Hanash, S* M., Strahier, J. R., Chan, Y. ( Kuick, R.. Teichroew, D., Neel, J. V., 

Hailat, N., Keim, D. R., Gratioi-Deans, J., Uncar, D., Richardson, B. C. 
(1993) Data base analysis of protein expression patterns during T~ceH 
ontogeny and activation. Proc. Nat!. Acad. Sci. U. S. A 90, 3314-3318 
15 Brichory, F. M., Misek, D. E, Yim, A M„ Krause, M. C, Giordano, T. J., 
Beer, D, G.. Hanash, S. M. (2001) An immune response manifested by 
the common occurrence of annexins I and Ij autoantibodies and high 
circulating levels of IL-6 in lung cancer. Proc. Natt. Acad Sci. U S. A. 98, 
9824-9629 

16. Lavens-Phillips, S. E. t MacGlashan, D. W., Jr. (2000) The tyrosine Kinases 

p53/56iyn and p72syk are differentially expressed at the protein level but 
not at the messenger RNA level in nonreleastng human basophils. Am. J. 
Respir. Cell Mot. Biol. 23, 566- 571 

17. Tusher, V. G., Tibshirani, R, Chu, G. (2001) Significance analysis of mi- 

croarrays applied to the ionizing radiation response. Proc. Natl. Acad. 
Sci. U, S. A 68, 5116-5121 

18 Tew *K D Monks, A, Barone, L.. Rosser, D., Akerman, G. t Montali, J. A, 

Wheatte/, J. B., Schmidt, D. E., Jr. (1996) Glutathione-associated en- 
zymes in the human ceil lines of the National Cancer Institute Drug 
Screeninc Program. Mol. Pharmacol. 50, 149-159 

19 Celis, J. E.* Kruhorfer, M., Grcmova, I., Frederiksen, C, Ostergaard, M., 

Thykjaer,' T., Gromov, P., Yu, J., Palsdottir, H M Magnusson, N., Orntoft, 
T. F. (2000) Gene expression profiling: monitoring transcription and 
translation products using DNA microarrays and proteomics. FEBS Left. 
480, 2-16 

20. Anderson, N. L, Anderson, N. G. (1998) Proteome and proteoses: new 

technologies, new concepts, and i>ew words. Electrophoresis 19, 
1853-1861 

21. Gygi, S. P., Corthate, G. L, Zhang, Y., Rochorv Y-, Aebersold, R. (2000) 

Evaluation of two-dimensional get eiectrophoresis-based proteome anal- 
ysis technology. Proc. Naff. Acad. Sci. U S. A 97, 9390-9395 

22. Fey, S. J. t Larsen. P. M. (2001) 2D or not 2D. Two- dimensional get elec- 

trophoresis. Curr. Opin. Chem. Biol. 5, 26-33 

23. McBride, S., Walsh, D., Meleady, P.. Da!y, N., Clynes, M. (1999) Bromode- 

oxyuridine induces keratin protein synthesis at a posttranscriptional level 
in human lung tumor ceil lines. Differentiation 64. 185-193 

24. Anderson, L. Seilhamer, J. (1997) A comparison of selected mRNA and 

protein abundances in human iiver. Electrophoresis 16, 533-537 

25. Gygi. S. P., Rochon, Y., Franza, B. R., Aebersold, R. (1999) Correlation 

betweer> protein and mRNA abundance in yeast. Mol. Cell. Biol. 19, 
1720-1730 



Molecular & Cellular Proteomics 1.4 31 3 



Research 



Discordant Protein and mRN A Expression in 
Lung Adenocarcinomas* 

Guoan ChenJ, Tarek G. Gharibt, Chiang-Ching Huang§, Jeremy M. G. Taylor§, 
David E. MisekU Sharon L. R. Kardia||, Thomas J. Giordano**, Mark D. lannettonit 
Mark B. Orringerfc Samir M. HanashU, and David G. Beerf: # 



The relationship between gene expression measured at 
the mRNA level and the corresponding protein level Is not 
well characterized in human cancer. In this study, we 
compared mRNA and protein expression for a cohort of 
genes In the same lung adenocarcinomas. The abun- 
dance of 165 protein spots representing 98 individual 
genes was analyzed in 76 lung adenocarcinomas and nine 
non-neoplastic lung tissues using two-dimensional poly- 
acrylamide gel electrophoresis. Specific polypeptides 
were identified using matrix-assisted laser desorption/ 
ionization mass spectrometry. For the same 85 samples, 
mRNA levels were determined using oligonucleotide mi- 
croarrays, allowing a comparative analysis of mRNA and 
protein expression among the 165 protein spots. Twenty- 
eight of the 165 protein spots (17%) or 21 of 98 genes 
(21.4%) had a statistically significant correlation between 
protein and mRNA expression (r > 0.2445; p < 0.05); 
however, among all 165 proteins the correlation coeffi- 
cient values (r) ranged from -0.467 to 0.442. Correlation 
coefficient values were not related to protein abundance. 
Further, no significant correlation between mRNA and 
protein expression was found (r = -0.025) if the average 
levels of mRNA or protein among all samples were applied 
across the 165 protein spots (98 genes). The mRNA/ 
protein correlation coefficient also varied among pro- 
teins with multiple isoforms, indicating potentially sep- 
arate fsoform-specific mechanisms for the regulation of 
protein abundance. Among the 21 genes with a signifi- 
cant correlation between mRNA and protein, five genes 
differed significantly between stage I and stage III lung 
adenocarcinomas. Using a quantitative analysis of mRNA 
and protein expression within the same lung adenocarci- 
nomas, we showed that only a subset of the proteins 
exhibited a significant correlation with mRNA abundance. 
Molecular & Cellular Proteomlcs 1:304-313, 2002. 



Lung cancer is the leading cause of cancer death for both 
men and women in the United States. Adenocarcinomas of 
the lung comprise -40% of all new cases of non-smaii ceil 
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lung cancer and are now the most common histologic type. 
Functional genomics, broadiy defined as the comprehensive 
analysis of genes and their products, have become a recent 
focus of the life sciences (1 ). Application of these approaches to 
lung adenocarcinomas has the potential to aid in the identifica- 
tion of high risk patients with resectable early stage lung cancer 
that may benefit from adjuvant therapy, as well as to identify 
new therapeutic targets. In human lung cancer, however, little is 
currently understood regarding the relationship between gene 
expression as determined by measuring mRNA levels and the 
corresponding abundance of the protein products. 

A number of powerful techniques for analysis of gene ex- 
pression have been used including differential display (2), 
serial analysis of gene expression (3), DNA microarrays (4), 
and proteomics via two-dimensional polyacrytamide gel elec- 
trophoresis and mass spectrometry (5). Bioinformatics tools 
have also been developed to help determine quantitative 
mRN A/protein expression profiles of all types of cells and 
tissues (6) and now can be applied to benign and malignant 
tumors. DNA microarrays (cDNA and oligonucleotide) permit 
the parallel assessment of thousands of genes and have been 
utilized in gene expression monitoring (7), polymorphism anal- 
ysis (8), and DNA sequencing (9). Recent studies have fo- 
cused on classification or identification of subgroups of lung 
tumors using DNA microarrays (10, 11). The use of mRNA 
expression patterns by themselves, however, is insufficient for 
understanding the expression of protein products, as addi- 
tional post- transcriptional mechanisms, including protein 
translation, post-translational modification, and degradation, 
may influence the level of a protein present in a given cell or 
tissue. Proteomic analyses, a complementary technology to 
DNA microarrays for monitoring gene expression, involves 
protein separation and quantitative assessment of protein 
spots using 2D 1 -PAGE and protein identification using mass 
spectrometry; By combining proteomic and transcriptional 
analyses of the same samples, however, it may be possible to 
understand the complex mechanisms influencing protein ex- 
pression in human cancer. 

In this study, we determined mRNA and protein levels for 
165 proteins (98 genes) in 76 lung adenocarcinomas and nine 



1 The abbreviations used are: 2D, two-dimensiona!; MALDI-MS, 
matrix-assisted laser desorplion/ioni2ation mass spectrometry. 
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An increased high-mobility group A2 expression level is associated with 
malignant phcaotype in pancreatic exocrine tissue. 

Abe N , Watanabe T , Suzuld Y t Mats unto to N , Masaki T . Mori T . Sugiyama M , 
Chiappetta G , Fusco A , Atomi Y . 

First Department of Surgery, Kyorin University School of Medicine, 6-20-2, Shinkawa, 
Mitaka, Tokyo 181-8.611, Japan, abetibtg@kyoriu-u.ac.jp 

The altered form of the high-mobility group A2 (HMGA2) gene is somehow related to 
the generation of human benign and malignant tumours of mesenchymal origin. 
However, only a few data on the expression of HMGA2 in malignant tumour originating 
from epithelial tissue are available. In this study, we examined the HMGA2 expression 
level in pancreatic carcinoma, and investigated whether alterations in the HMGA2 
expression level are associated with a malignant phenotype in pancreatic tissue. High- 
mobility group A2 mRNA'and protein expression was determined in eight surgically 
resected specimens of non-neoplastic tissue (six specimens of normal pancreatic tissue 
and two of chronic pancreatitis tissue) and 27 pancreatic carcinomas by highly sensitive 
reverse transcriptase-polymerase chain reaction (RT-PCR) techniques and 
immunohistochemical staining, respectively. Reverse transcriptase-polymerase chain 
reaction analysis revealed the expression of rite HMGA2 gene in non-neoplastic 
pancreatic tissue, although its expression level was significantly lower than that in 
carcinoma. Immunohistochemical analysis indicated that the presence of the HMGA2 
gene in non-neoplastic pancreatic tissue observed in RT-PCR reflects its abundant 
expression in islet cells, together with its focal expression in duct epithelial cells. Intense 
and multifocal or diffuse HMGA2 immunoreactivity was noted in all the pancreatic 
carcinoma examined, A strong correlation between HMGA2 overexpression and the 
diagnosis of carcinoma was statistically verified. Based on these findings, we propose 
that an increased expression level of the HMGA2 protein is closely associated with the 
malignant phenotype in the pancreatic exocrine system, and accordingly, HMGA2 could 
serve as a potential diagnostic molecular marker for distinguishing pancreatic malignant 
cells from non-neoplastic pancreatic exocrine cells. 
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extracts. ]f these minor cell proteins differ among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression oflts Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capabie of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver lo increase the 
• production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond lo glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization— different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc : 
tive character. These features reflect. the persistent expression ol different sets of 
genes. 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 1 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are. many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is tianscribed (tran- 
scriptional control), (2) controlling how the primary RN A transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mfiNAs in the cell nucleus are exported to the cytoplasm (UNA transport con- 
trol), I -J) selecting which mKN/Ys in the cytoplasm are translated by ri bosom es 
ICnmslaiionnl control), (5) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules niter they have been 
made (protein activity control) (Figure 0-2).' 

For most genes transcriptional controls are paramount. This makes sense 
because, ol all the possible control points ill ma rated in Figure 0-2. only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Summary 

The many types of tells in animals ami plants ore created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unitpte character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished. and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Vrocaryotes and yeasts provide unusually accessible model 
systems in which to study gene, regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher encuryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
Wins, each of which inhibits the synthesis of the other; this can create a flip-flop 
switch that switches a cell between two a I tentative patterns of gene, expression. Di- 
rect or indited positive feedback- loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In ettctn votes gene transcription is generally controlled by combinations of gene 
' regulatory proteins. Il is thought that each type of cell in a higher eucuryolic organism 
contains o specific combimaiim itfgene regiduiory proteins that ensures the expres- 
sion of only those genes appropriate to that type oj cell. A given gene regulatory pro- 
tein amy be expressed it} a variety of circumstances a tut lypictdly is involved in the 
regulation of many genes. 

In tuldition to diffusible gene regulatory proteins, inherited stales of chromatin 
condensation are also utilized by encaryotic cells to regulate gene expression. In ver- 
tebrates methyhuittn also plays a part, mainly as a device tit reinforce tlecisions 
nhtnn gene expression thai are ma tie initially by other mechanisms. 



Although i onirols on the initiation ol e,ene I t ansetipl ion ;nx the predominant 
lomi ol tv^nlaiion Jor mnsi i;<*nex oibei controls tan acl later in the pathway 
horn UNA to ptotein 10 modulate the amount ol gene produei thai is made. Al- 
though these post transcriptional controls, which opeiate alter RNA polymerase 
has bound to the gene's promoter ami hegnn KNA ■.ynthesis. ate less common 
than matscripiinnat control, loi many ernes (hey aie Oneial. It seems that every 
step in gene expulsion that t. on hi he eon l tolled in ptineiple is likely to he u'jm- 
faied under some eiicumsi;mces tor some series 

We e * mis id e i the varieties ol post I r ans*. > i phonal regulation in temporal 01- 
dei. aeeimlni:; to the semrenee e.t .vents drat mighi he experienced by an UNA 
molecule aliei its itan.-cripMon has begnn I riemr e '>-7^i. 
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Figure 6—3 Genes can be expressed 
with different efficiencies. Gene A is 
transcribed and translated much more 
efficiently than gene BThis allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in rending out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
J*NA nucleotide sequence. The information in RNA, although copied into .mother 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6^1). It differs from 
DNA chemically in two respects: (}) the nucleotides • in RNA are 
ribonucleotides— that is; they contain the sugar ribose (hence the name f ibomi- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine IG), and cytosine (C), it contains. the base uracil |U) 
instead of the thymine IT) in DNA. Since U, like X can base-pair by hydrogen 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters -I and 5 apply also to RNA (in RNA. G pairs with 
C, and A pairs with U). )t is not uncommon, however, to find other types ol ba.se 
pairs in RNA: foi example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single- stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have sum: final 
and catalytic functions. 

Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a < ell is made by D N A t r a n s c i i p i i o n , a p i oc e ss thai h ; i :, c e t - 
tain similarities to the piixes:. of DNA replication discussed in Cbapin 5» 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eocaryottc cell 
depends upon the efficiency of each step 
depicted 
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urc 6-00) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step — the transcription of its DNA sequence into an UNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mHNA molecule into protein takes 
place in the cytoplasm on a large ribonncleoprotein assembly called a ri bosom e. J he 
amino acids used for protein synthesis ore first attached to n family of tftNA 
molecules, each of which recognizes, by complementary base- pint interactions, par- 
ticular sets of three nucleotides in the ntUNA icodons). J' he sequence of nucleotides in 
the m.HNA is then read from one end to the other in sets of three nccortling to the 
genetic code. 

To initiate translation, a small ribosoma! subnnit binds to the at UNA molecule 
at a start rod on (AUG) that is recognized by a unique imtiniar tWMA molecule. A 
large tibosomal Sidntnit binds to complete the ribosonte ami begin the elongation 
phase of protein synthesis. During this phase, a mi none yl tRNAs—ench heating a 
specific amino acid bind sequentially to the appropriate cot I on in mKA/1 br forming 
complementary base pairs with the tBNA nnficotlon.'luich amino acid is added to the 
C- terminal end of the grooving polypeptide by ' means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic ger»e expressron can be 
controlled. Controls that operate at 
steps I through 5 are discussed in this 
chapter. Step 6. the regulation of protein 
activity, includes reversible activation or 
inacttvation by protein phosphorylation 
{discussed in Chapter 3) as well as 
irreversible inactrvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

if differences among the various cell types of an organism depend on The partic- 
ular genes that ihe cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, Ihere are many steps in ihe pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
2 cell can control the proteins it makes by (1) controlling when and.how often a 
given gene is transcribed {transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in I he cell nucleus are exported to thecytosol 
and determining where in the cytosol they are localized {UNA transport ami 
localization control), (^1) selecting which mRNAs in (he cytoplasm are translated 
by ribosomes ( transnational control), (5) selectively destabilizing certain in RNA 
molecules in the cytoplasm (mRNA degradation control), or (f>) selectively ;Kti- 
vnting, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure ?-!>). 

For most genes transcriptional controls are paramount. This makes "sense 
because, of all the possible control points illustrated in Figure 7-f>, only Iran 
script ional control ensures that the cell will not synthesize superfluous interme- 
diates. In Ihe following sections 'we discuss the DNA and protein components 
; ]iat perform this function by regulating the initiation of gene transcription. We 
-hall return at Ihe end of the chapter to the additional ways of regulating gene 
repression. 



nummary 

ihe genome of n cell contains in its UNA sequence the information to moke funny 

housands of different protein and UNA molecules. A cell typically expresses only a 
fraction of its genes, nnrf the different types of cells in multicellular organisms tnise 
-■-cause different sets of genes are expressed. Moreover, cells enn change the pattern 
; / genes they express in response to changes in their environment, snch as sigmds 

: cm other cells. Although all of the steps involved in expressing a gene can in prin- 
■. :nle be regulated, for most genes the initiation of RNA transcription is the most 

..portnnt point of control. 



" ax* does a cell determine which of iis thousands of genes lo transcribe":' As 
> rn honed briefly in Chapters -t and"K. the tianser iption of each gene is con- 
' iJed by a regulatory region of DNA tela lively near the site whore nanscription 
i -jn$. .Some icgulaioiy legions aie simple and act as switches that are t hi own 
single signal. Many otheis a a- compl«-\ and ;n. i as tiny microprocessors. 
F o n ding i o a var iety of signals that i h ■ * v r n t e rprei and integral e t o s \ v i t c h the 
- > : h b oiing gene on or o II. Wh e t h e t c « > m p 1 e.\ or simple, the se s w i t e h i n g < I e v i c e 
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We examined the effectiveness of various anti-tumour agents to natural killer (NK)-cel! 
tumour cell lines and samples, which are generally resistant to chemotherapy, using flow 
cytometric terminal deoxynucleotidyl trans fcrase-mediated dUTP-biotin nick end- 
labelling (TUNEL) assay. Although NK-YS and NK-92 were highly resistant to various 
anti-tumour agents, l-asparaginase induced apoptosis in these two NK-cell lines. NK-cell 
leukaenua/lymphoma and acute lymphoblastic leukaemia (ALL) samples, were . 
selectively sensitive to l-asparaginase and to doxorubicin (DXR) respectively. Samples of 
chronic NK lymphocytosis, an NK-cell disorder with an indolent clinical course, were 
resistant to both drugs. Our study clearly separated two major categories of NK-cell 
disorders and ALL according to the sensitivity to DXR and l-asparaginase. We examined 
asparagine synthetase levels by real-time quantitative polymerase chain reaction (RQ- 
PCR) and immunostaining in these samples. At least in nasal-type NK-cell lymphoma, 
there was a good correlation among asparagine synthetase expression, in vitro sensitivity 
and clinicalresponse to l-asparaginase. In aggressive NK-cell leukaemia, although 
asparagine synthetase expression was high at both mRNA and protein levels, l- 
asparaginase induced considerable apoptosis. Furthermore, samples of each disease entity 
occupied a distinct area in two-dimensional plotting with asparagine synthetase mRNA 
level (RQ-PCR) and in vitrol-asparaginase sensitivity (TUNEL assay). We confirmed 
rather specific anti-tumour activity of I-asparaginase against NK-cell tumours in vitro, 
which provides an experimental background to the clinical use of l-asparaginase for NK- 
cell tumours. 
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metaHoproteinase inhibitor-1 messenger-RNA and protein. 

A«st G t Hofmann-A, Laue S, RostA, Kohler T . Scherbaum WA . 
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Matrix metaHoproteinase- 1 (MMP-1) and tissue matrix metaHoproteinase inhibitor 1 
(TIMP-I) play an important role in remodeling the extracellular matrix in normal and 
pathological processes. The effect of phorbol-myristate acetate (PMA), interleukin-1 (1L- 
1), and rumor necrosis factor-alpha (TNF-aipha) on MMP-1 and TIMP-1 expression was 
studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 
1736 thyroid carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA 
(mRNA) levels were monitored by competitive semiquantitative reverse transcriptase 
polymerase chain reaction (RT-PCR) after 24 hours. Culture supernatants were assayed 
for free and/or complexed MMP-1 and TIMP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELISA) systems (detection limit: <2 ng/mL). MMP-1 and TIMP-1 
mRNA were present in all cell types, although thyrocytes showed MMP-1 mRNA levels 
near the detection limit. 8505 C expressed MMP-I mRNA levels of up to 10(6) times 
those of the other cells analyzed. PMA and IL-1 increased MMP-I mRNA in most cell . 
types. TIMP-1 mRNA increased after treatment with PMA in all cells except 8505 C, 
whereas only slight effects were shown after IL-1 stimulation. MMP-1 protein was- 
undetectable in normal thyrocyte cultures, but was secreted spontaneously by all cell 
lines ([ng/mL]; C 643:.15+/-7; HTh 74: 81+/-1; SW 1736: 13+/-2; 8505 C: 2097+A320). 
There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99, p 
< .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. 
Fibroblasts ([ng/mL] 517+/-55) and the cell lines (C 643: I42+/-48; HTh 74: 1 15+/-] 3* 
SW 1736: 202+/-J4; 8505C: 120+/- 19) secreted TIMP-1 in unstimulated cultures, 
whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated TIMP.-l secretion after 48 hours in SW 1736, HTh 74, and C 
643 cells. Our data suggest that in contrast to normal thyrocytes, dedifferentiated thyroid 
carcinoma cell lines are potential producers of MMP-1 as well asTJMP-1. High MMP-1 
of MMP-l/TIMP-1 expression may play a role in tissue invasion of undifferentiated 
thyroid cancer cells. 
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ABSTRACT 

Matrix metalloproteinase- 1 (MMP-1) and tissue matrix metalloproteinase inhibitor 1 (TTMP-l) play an Impor- 
tant role in remodeling the extracellular matrix in normal and pathological processes. The effect of phorboI-mynV 
tate acetate {PMA), interleukin-1 (EL-1), and tumor necrosis factors (TNF-<r) on MMP-1 and T1MP-1 expres- 
sion was studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 1736 thyroid 
carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA (mRNA) levels were monitored by 
competitive semiquantitative reverse transcriptase polymerase chain reaction (RT-PCR) after 24 hours. Culture 
supernatant* were assayed for free and/or complexed MMP-1 and T1MP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELISA) systems (detection limit: <2 ng/mL). MMP-1 and T1MP-1 mRNA were present in 
all cell types, although thyrocytes showed MMP-1 mRNA levels near the detection limit. 8505 C expressed MMP- 
1 mRNA levels of up to 10* times those of the other cells analyzed. PMA and BL-1 increased MMP-1 mRNA in 
most cell types. T1MP-1 mRNA increased after treatment with PMA in all cells except 8505 C, whereas only 
slight effects were shown after BL-1 stimulation. MMP-1 protein was undetectable in normal thyrocyte cultures, 
but was secreted spontaneously by all cell lines {[ng/mL]; C 643: 15 ± 7; HTh 74: 81 ±1; SW 1736: 13 + 2; 
8505C: 2097 ± 320). TTiere was a strong correlation between levels of MMP-1 mRNA and protein (i = 0.99, 
p < .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. Fibroblasts ([ng/mL] 517 ± 
55) and the cell lines {C 643: 142 ± 48; HTh 74: 115 ± 13; SW 1736: 202 ± 14; 8505 C: 120 ± 19) secreted 
nMP-1 in unstimulated cultures, whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated T1MP-1 secretion after 48 hours in SW 1736, HTh 74, and C 643 cells. Our data 
suggest that in contrast to normal thyrocytes, dedifferentiated thyroid carcinoma cell lines are potential produc- 
ers of MMP-1 as well as TIMP-1. High MMP-1 or MMP-l/lTMP-1 expression may play a role in tissue inva- 
sion of undifferentiated thyroid cancer cells. 



INTRODUCTION suppressed on external scimula tion, ie, with cytokines, 

Mphorbol 12-myristate 13-acetate (PMA), lipopolysaccaride 

ATRDt METALLOPROTEINASE5, (MMPs) constitute a (LPS), or retinoic acid (2,3). After secretion at post-tran- 

farmiy of structurally related proteolytic enzymes re- scriptional level, latent MMP proenzymes are regulated by 

sponsible for the proteolytic degradation of extracellular proteolytic activation and interaction with tissue inhibitors 

matrix (ECM) components. They are important partici- of matrix metalloproteinase {TIMPs), their specific in- 

pants m normal tissue remodeling and contribute to the hibitors. Any imbalance between the proteolytic MMPs ac- 

phenorype of several pathological conditions that are as- tivities and the TIMPs that could be influenced and caused 

sociaied with progressive ECM degradation. MMPs are by cytokines could potentially lead to pathological condi- 

highly regulated at different levels (1). At the transcrip- tions (4). 

tional level, MMP expression can be directly induced or MMP-1, although known as an interstitial collagenase, 
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is the only enzyme active at neutral pH that can degrade 
extracellular fibers comprised of collagen types I, II, and 
III. With this initial step, MMP-1 provides the cleavage 
products to other collagenase types (5). The major specific 
inhibitor of MMP-1 is TIMP-1, a 28.5-kd glycoprotein, 
which forms 1:1 stoichiometric complexes with the pro- 
tease (6). Cytokines and growth factors have been shown, 
to regulate the expression of both MMP-1 "and TIMP-1 
d>7>8). 

Although the participation of MMP-1 as the initial col- 
lagenase in tissue breakdown during tumor development is 
well documented (9-11), only one study has described the 
expression (12) but no study has as yet investigated the reg-: 
ulation of this enzyme in different thyroid rumors. Few stud- 
ies have been published investigating the role of other 
MMPs in normal and pathological thyroid tissue by in situ 
hybridization and immunohistochemistry (13-16). Fur- 
thermore, tissue remodeling includes both the action of 
MMPs and their inhibitors; thus, these enzymes could be 
involved in autoimmune and other nonautoimmune thyroid 
diseases during morphological changes (17,18). It is still un- 
known whether or not thyrocytes are able to express MMPs 
and TlMPs. Although type IV collagenases (MMP-2 and 
MMP-9) were detected in various human epithelial cells of 
different tissue origin (19,20), only one study described the 
secretion of MMP-1 by epithelial cells (21). 

Highly purified normal thyrocytes and four thyroid car- 
cinoma cell lines were included in this study to investigate 
the involvement of these cells in MMP-1 and TJMP-1 pro- 
duction during thyroid tissue remodeling processes and in 
malignant thyroid neoplasms. MMP-1 and TIMP-1 ex- 
pression were studied at both the mRNA and protein level 
by semiquantitative RT-PCR and EL1SA measurement, re- 
spectively. 

In unstimulated carcinoma cell lines both MMP-1 and 
TIMP-1 mRNA were expressed, partly at a high level, fol- 
lowed by the spontaneous secretion of the proteins. The 
various conditions for the stimulation of the different cell 
lines by cytokines and PMA were defined. In contrast to 
the cell lines, normal thyrocytes did not secrete MMP-1 
and only trace amount of TIMP-1, even after stimulation 
with PMA. 



MATERIALS AND METHODS 

Preparation of tissues, thyroid- derived cells, and 
cell lines 

Thyrocytes were prepared from surgical thyroid speci- 
mens from 3 pariems (1 Graves' disease, 2 nonioxic goi- 
ter; mean age 54.3 ± 5.0 years). Fibroblasts were separated 
from thyroid tissue of 5 other patients (3 Graves' disease, 
2 nontoxic goiter; mean age -13.6 ± 6.4 years). Graves' dis- 
ease and nontoxic goiter were diagnosed on the strengrh 
of clinical, biochemical, and immunologic features as well 
as thyroid scintiscans. 

Thyroid tissue was trimmed of fai and connective tissue 
immediately after surgery. Thyroid-derived cells were en- 
riched after gradual enzymatic digestion of tissue and cul- 
tured over a period of 16 hours as described. Thyrocytes 
were obtained from rhe adherent fiacrion by incubating 
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the cell monolayer with phosphate buffered saline (PBS) 
without Ca 2 ~/Mg 2+ for 45 minutes [22), Residual fibro- 
blasts were removed after subsequent incubation of the 
cells with the fibroblast-speciftc mab FibASOl (22) and 
goat-anti-mouse IgG-DYNABEADS(S> M45Q (DYNAL, 
Hamburg, Germany) according to the manufacturer's pro- 
tocol. 

Thyroid-derived fibroblasts were obtained after culturing 
small pieces of thyroid tissue in Dulbeccos's Modified Ea- 
gle's Medium (DMEM) with 10% fetal calf serum (FCS) 
, and harvested in the 5th to 7th passage. The purity of the 
thyrocytes and fibroblasts was determined by using indirect 
immunofluorescence technique on a FACS-Scan (Becton 
Dickinson GmbH, Heidelberg, Germany) as described (22). 

The following human anaplastic thyroid carcinoma cell 
lines were cultured in DMEM with 10% FCS: C 643 (23)- 
SW 1736 (23); and HTh 74 (24). The cell line 8505 C (25) 
was purchased from the German Collection of Microort 
ganisms and Animal Cell Cultures (DSM ACC219). This 
cell line was established from a primary thyroid tumor 
characterized histologically as a undifferentiated carcinoma 
that was partially composed of poorly differentiated pap- 
illary cells (25). This is a feature of a subgroup of anaplas- 
tic carcinoma (26). The majority of these coexistent better 
differentiated carcinoma foci in anaplastic carcinoma were 
papillary (26). . 

In vitro cultures 

Using 24- well plates, 1 X 10* cells were cultured for 24 
hours. The medium was aspirated and replaced with 500 
/iL OPTI-MEM (G1BCO BRL, Grand Island, NY) without 
FCS to eliminate possible stimulation of MMP-1 and 
TIMP-1 production by FCS. The medium contained the 
desired concentration of human IL-la (10 U/ml; Pepro 
Tech EC Ltd., London, UK), TNF-a (100 U/rnL; Pepro 
Tech EC Ltd.), interferon^ (IFN-y) (500 U/mL; Pepro 
Tech EC Ltd.), or 10 ng/mL PMA (SIGMA). 

Triplicate cultures of each stimulator were analyzed af- 
ter 3, 6, and 24 hours at the mRNA and after 24 and 48 
hours at the protein level. The supernatants were removed 
and stored at -80*C for further use, First,.a coll age nolytk 
assay based on the digestion of type I collagen was per- 
formed. This method showed direct evidence of free pro- 
MMP-I eniyme in the cell culture supernatants of un- 
stimulated and IL-lor stimulated 8505 C, HTh 74, and 
C634 cells (data not shown). However, the method does 
not allow quantitation of MMP-1 enzyme activity. Thus, 
the cell culture supernatants were assayed for MMP-1, 
TIMP-1, and MMP1/T1MP-1 complex by ELISA (Amer- 
sham Life Sciences, Braunschweig, Germany). The MMP- 
1 assay (sensitivity: 1.7 ng/mL) detected only total human 
MMP-1, ie, free MMP-1 and MMP-1 complexcd with in- 
hibitors such as TJMP-3. It did not detect MMP-1 bound 
by the nonspecific protease inhibitor a 2 macro$obvA\n* 
The MMP-l/TlMP-1 assay (sensitivity: 1.5 ng/mL) de- 
tected MMP-J/TIMP-3 complex, ie, activated,MMP-l thai 
has been subsequently complexed with the specific MMP- 
1 inhibitor TLMP-1. It did not detect free -active MMP-1, 
free TIMP-1, or pro-MMP-1. There was no cross-reactiv- 
iry with active MMP-1 bound by the nonspecific protease 
inhibitor (^microglobulin. The TIM P- 1 assay (sensitivity: 
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1.25 ng/mL) detected total human TIMP 1, ie, free TIMP- 
1 and that completed with MMPs. The assay did fully 
cross-react with TIMP-1 in complexes with other MMP. 
It did not cross-react with TIMP-2. 

RNA extraction and cDNA synthesis 

For gene expression studies, 5 mL RNAzolJ" B (Biotex 
Laboratories Inc., Houston, TX) was added to the cell cul- 
ture wells. The content of three wells of any cell type was 
pooled and then stored frozen for further mRN A analysis 
in liquid nitrogen. Total cellular RNA (cDNA) was iso- 
lated from the probes according to the manufacturer's pro- 
tocol. RNA was fractionated on a denaturing 1.0% agarose 
gel and stained with ethidium bromide to confirm that, 
spectrophotoroetric measurements were accurate and that 
the RNA had not been degraded. Five micrograms total 
RNA was reverse-transcribed to cDNA using the First- 
strand cDNA synthesis kit of Pharmacia (Uppsala, Swe- 
den) in a total reaction volume of 15 pL. 

mRNA analysis by competitive RT-PCR 

To correct for variations across different cDNA prepa- 
ration^ all samples were first adjusted to contain equal in- 
put gIyceralderyde-3-phosphate dehydrogenase (GAPDH) 
cDNA concentrations. Semi-quantitative GAPDH RT-PCR 
was used with a heterologeous synthetic competitor frag- 
ment. The generation of the specific PCR products from 
the competitor and the cDNA with the GAPDH primers 
were published earlier (22,27). 

We then estimated the MMP-1 and TIMP. 1 cDNA in 
these adjusted samples. The primers were selected using the 
DNAsis computer program (Hitachi Software Engineering 
Co, Yokohama, Japan). The primer pairs span one or more 
introns to allow unambiguous discrimination between 
cDNA and unwelcome contaminating genomic DNA. In 
quantifying MMP-1 and T1MP-1 cDNA, a rapid one-step 
method was introduced to synthesize an internal homolo- 
geous competitor (plan diagram of procedure: Fig. 1, ex- 
emplary for MMP-1 [28). A hybrid primer was synthesized 
(MMP-lhy) that consisted of two segments (segi, seg2). It 



MMPtl 



5* 



seg, 



PCR 



478 bp 



MMPlhy 1 MMPlr 

. : 82bp 
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FIG. 1. Genera) scheme for generating homologous con 
petitors used for quantitative PCR. 
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had a length of 40 nucleotides, in which 20 nucleotides (se gj ) 
at the 3' end corresponded to the opposite strand of the tar- 
get sequence a predetermined distance from primer MMP- 
If* and 20 nucleotides at the 5' end (segi = MMP-lr) that 
corresponded to the target sequence upstream from the seg- 
ment segj. Amplification with the primers MMP-lf and 
MMP-lhy from the cDNA resulted in a 478-base pair (bp) 
(polymerase chain reaction (PCR) product. It was freed from 
excess primers and deoxynucleoside-ttiphosphates (dNTPs) 
using the Qiaquick Gel Extraction Kit (Qiagen GmbH, 
Hilden, Germany) and quantified. A known number of 
copies of the competitor was introduced in the GAPDH-ad- 
justed samples and amplified with the primers MMP-li and 
MMP-lr. With this approach, two products were generated, 
one derived from the cDNA (560 bp) arid another, 82 bp 
smaller in size derived from the internal competitor (Fig. 1). 
PCR products were resolved by gel electrophoresis (1.5% 
agarose gel). The relative amounts of sample cDNA and 
competitor were quantified by measuring the intensity of 
ethidium fluorescence with a CCD image sensor and ana- 
lyzing the data with the EASY program (Herolab, Wiesloch, 
Germany). The initial amounts of sample cDNA and com- 
petitor were assumed to be equal in those reactions where 
the ratio of the two products was judged to be equal. This 
was expressed in arbitrary units (AU) (22,29). One AU was 
defined as the lowest concentration of competitoir yielding 
a detectable amplification product when added to PCR 
alone. For example, if equivalence between sample cDNA 
and competitor was reached using a 100-fold concentrated 
competitor the relative sample cDNA concentration was 100 
AU. Thyrocytes and the ceil lines were analyzed for the ex- 
pression of thyroid-specific and cytokine receptor mRN As 
in a simple RT-PCR. The sequences of the TPO and cy- 
tokine receptor primer pairs have been published by Wat- 
son et ah (30) and Tada et ai. (31) and gave the following 
product sizes: TPO: 506 bp; IL-1R type I (p80):300 bp; 1L- 
1R type D (p68): 392 bp; TNF-aR (p75): 324 bp; TNF-aR 
(p55): 587 bp and IFN-yR; B99 bp. The thyroglobulin (Tg) 
and thyroid stimulating hormone receptor (TSH-R) primer 
pairs were selected according to the published sequences us- 
ing the DNAsis program {Table 1); 

Each 25-uL amplification reaction contained 2.5 nX 
10 X concentrated PCR buffer (15 mM MgCI 2> Boehringer 
Mannheim, Germany), 0.3 U Taq DNA polymerase 
(Boehringer Mannheim, Germany), 100 /jjvi dNTPs 
(Perkin Elmer, Weiterstadt, Germany), 0.1 pM of each 
primer (1MB, Jena, Germany), and 1 cDNA and com- 
petitor in adjusted dilution. Furthermore, restriction map- 
ping (restriction enzymes: Boehringer Mannheim GmbH, 
Germany) was carried out to confirm the originality of the 
PCR product (Fig. 1, Table 1). 

Statistics • 

Protein levels of thyrocyte or fibroblast cultures from the 
different patients and of the thyroid carcinoma cell lines ob- 
tained from three separate experiments were presented as 
mean ± SEM values. Starisrical comparisons, between un- 
stimulated and stimulated cell cultures were performed by 
the alternate {Welch) t-test. The correlation berween basal 
mRNA levels and ihe unstimulated protein secretion in all 
cell types was calculaied according to the Spearman method. 
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Table 1. Primers. Length of Amplified Templates, Restriction Mapping and Assay CbNomoNs for RT-PCR 



Primer Length, of Length of Annealing , Number of 

5'* -J* cDNA (bp) competitor (bp) temperature cycles 



Tg 


forward 


GCAGATCTTACTGAGTGGCT 


416 




60 


35 


reverse 


TGTCAGCACAGTGCCAATAC 










TSH-R 


forward 


ACTTGCTGCAGGTGGTGCT 


354 




65 


35 


' exons 1-4 


reverse 


TGAGGGCATCAGGGTCTATG 










TSH-R 


forward 


GAAATTCGGAATACCAGGAACTTA ACT 


896 




53 ' ' 


35 


exons 4-10 


reverse 


AACTCATCGGACjTGGGGGTACA 










MMP-I 


forward 
reverse 
hybrid 


TGGGAGCAAACACATCTGAC 
ATCACrrCTCCGCGAATCGT 
ATCACTTCTCCCCGAATCGT 
CCATATATGGGTTGGATGCC 


560 


478 • 


64 


33 


TLMP-1 


forward 
reverse 
hybrid 


CTTAGGGGATGCCGCTGACA 
GGCAGGCAGGCAAGGTGACG 
GGCAGGCAGGCAAGGTGACG 
GGATGGATAAACAGGGAAAC 


351 


274 . 


64 


30 



Tg indicates thyroglobuline; TSH-R, thyroid stimulating hormone receptor, MMP-1, matrix roeiaHoproteiruise-I;~ TIMP-1, tissue inhibitor of 
mctalloproteinasc-1; bp, base pair. 



RESULTS 

Thyroid specific and. cytokine receptor 
mRNA expression 

Isolated thyrocytes as well as 8505 C cells expressed Tg 
and thyroperoxidase (TPO) mRNA, whereas transcripts of 
the TSH-R (exons 1-4, 354 bp, exons 4-10, 896 bp) were 
present only in the thyrocytes. The three anaplastic thy- 
roid carcinoma cell lines SW 1736\ C 634, and HTh 74 
were completely negative for the Tg, TPO, and TSH-R mR- 
NAs (Fig. 2). All cell lines and thyrocytes expressed 1L-1R 
(type I and type II), TNF-orR (p75 and p55) and IFN- r R 
mRNA (Fig. 2). 

Basal MMP-J and TIMP-1 mRNA and 
protein expression 

In most stimulation experiments, mRNA levels did not 
increase until 24 hours of incubation. The 24-hour mRNA 
levels are shown in Figures 3 and 4. The 3- and 6-hour 
levels are demonstrated in those experiments where the 
mRNA levels reached their peak before 24 hours of stim- 
ulation. If not otherwise indicated, the MMP-1 levels were 
measured using the ELISA system, which recognizes 
free/complexed MMP-1. 

'MMP-1 and TIMP-1 mRNA were found during un- 
stimulated culture in all investigated cell types, although 
the mRNA levels varied over a great range. 8505 C showed 
a basal MMP-1 mRNA level 20 times as high as those of 
the HTh 74 cells, 6 X JO 4 limes as high as C 643, and 2 X 
JO 6 times as high as SW 1736 cells. In thyrocytes, MMP- 
1 mRNA levels were found near ihe detection limit (Figs. 
3 and 4). 

Generally, when analyzing the noted cell types, the mea- 
sured basal MMP-1 or TIMP-1 mRNA levels correlated 
well with the basal protein expression (MMP-1: r = 0.99, 
p < ,000 1; TIMP-1 :r = 03$ y p< .002). Corresponding to 
the high MMP-1 mRNA level, 8505 C cells secreted ex- 
tremely high levels of MMP-.1. No MM P- 1 or TIMP-1 was 



detected in unstimulated thyrocyte cultures at any time- 
point examined. All other cell types showed a spontaneous 
MMP-1 and TIMP-1 secretion (Figs. 5 and 6). Thyroid- 
derived fibroblasts produced basal TIMP-1 levels of up to 
4 times higher in the four carcinoma cell lines, which se- 
creted nearly the same amounts of basal TIMP-1 protein. 
Nevertheless, TIMP-1 secretion of fibroblasts was found 
at lower levels than expected after TIMP-1 mRNA mea- 
surement in 4 of 5 analyzed patients. The results of the fi- 
broblast cultures from patient five showing a higher TIMP- 
1 expression than those from ihe 4 other patients (basal 
24 hour: 50 ± 2; PMA 24 hour: 90 ± 6 ng/mL TTMP-1) 
was omitted in Figure 6. 

Comparing the basal amount of free/complexed and 
TIMP-1 completed MMP-1 after 24 hours of stimulation, 
a significant level of MMP-1 was not complexetl with 
TIMP-1 in 8505 C cultures, whereas in fibroblast culrures 
most of the MMP-1 activity was inhibited by TIMP-1. The 
anaplastic carcinoma cell line HTh 74 did not show such 
a great discrepancy between free/complexed and TTMP-1 
complexed MMP-1 level as 8505 C cells (Fig. 7). 

Effects of lL-la on MMP-1 and TJMP-1 mRNA and 
protein expression 

Experiments were performed to determine whether hu- 
man thyroid epithelial cells and thyroid carcinoma cell lines 
could produce or increase basal MMP-1 and TTMP-1 se- 
cretion after exposure to various stimuli. The results from 
these stimulation experiments are summarized in Figures 
3 through 5. Generally, there was a delay in protein se- 
cretion level in comparison to the mRNA expression level. 
At the protein level, the cytokine-mediated stimulating or 
inhibiting effect is more distinct after 48 hours compared 
with 24 hours, even when the mRNA level had already de- 
creased after 6 hours. 

1L 1 upregulated MMP-1 mRNA in SW 1736>cells up 
to 100 times and, in thyroid-derived fibroblasts, up to 12 
times after 24 hours of incubation (Fig. 3). This increased 
mRNA level was accompanied by a significandy enhanced 
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TPO 
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TSH-R 
Exon 1- 4 

11-1 -R1 
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FIG. 2. Amplification of thyroid specific and interleukin-receptor mRNA in thyrocytes (1), SW 1736 (2), C 64 3 (3), HTh 
74 (4) and 8505 C {5) cells using RT-PCR; M, = 100-bp ladder (GIBCO). 



MMP-1 secretion after 48 hours. Furthermore, IL-la in- 
creased MMP-1 mRNA expression in thyrocytes up to 
seven times after 6 hours, but no" MMP-1 protein could be 
detected in thyrocyte cultures. 1L-1 had no stimulatory ef- 
fect on MMP-1 mRNA expression in C 643, HTh 74, and 
8505 C cells after 24 hours, although a significant increase 
of MMP-1 secretion was found in HTh-74 and SW 1736 
cells after 48 hours of incubation (Fig. 5). This discrepancy 
may be explained by a possible increase in MMP-1 mRNA 
level after 24 hours of stimulation. The S3me effect could 
also be observed in the 1L-1 stimulated T1MP-1 at the 
mRNA as well as the protein level: the only slight effect 
of 1L-1 on T1MP-1 mRNA expression. in carcinoma cell 
lines after 24 hours was accompanied by a significant in- 
crease of T1MP-1 secretion in 8505 C and HTh 74 cells 
after 48 hours (Figs. 4 and 6). 

Effects of TNF-a on both MMP-1/T1MP-1 mRNA 
and protein expression 

In contrast to IL-1, TNF-or did not stimulate the MMP- 
1 and TJMP-1 mRNA and protein levels in all carcinoma 
cell lines and thyrocytes. Only thyroid-derived fibroblasts 
responded with a slight upregulation of MMP- 1 and TIMP- 
1 mRNA expression after TNF-n stimulation, which was 
not accompanied by an increase of MMP-1 and JIMP 1 
secretion. 



Effects of PMA, and IFN-y on MMP-1 and TJMP1 
mRNA and protein expression 

PMA was included in our study as a positive control 
because it is known to upregulate or induce both MMP- 
I and TIMP-1 secretion in various cell types (1,32). In- 
deed, PMA was able to induce or enhance MMP-1 
mRNA levels in all cell types investigated, although the 
detected levels varied to a large extent (Fig. 3). This re- . 
suit is in good correlation with the significantly increased 
MMP-1 protein levels that were already detectable after 
24 hours of stimulation ( Fig. 5). PMA upregulared T1MP- 

1 mRNA levels by up to 20 times in C 643, and up to 

2 times in SW 1736 and HTh 74 cells, fibroblasts and 
thyrocytes, but it did not change the TiMP-1 mRNA con- 
tent in 8505 cells (Fig. 4). At the protein level, we found 
a significant stimulation of TIMP-1 secretion in C 643 
and HTh 74 cells, as well as in thyroid-derived fibrob- 
lasts (Fig. 6). 

In contrast to PMA, IFN--y was without effect on stim- 
ulation or downregulation of MMP-1 and TIMP-1 mRNA 
or protein in anv of the cell types investigated (Figs 5 and 
6). 

The main inhibitor of MMP-1 isTlMP-l, which forms 
1:1 stoichiometric complexes with MMP- 1, although some 
other inhibitors can also bind MMP-1. On the other hand, 
TIMP-1 can bind other MMP types. 
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FIG. 3. Representative samples of competitive amplified MMP-I mRNA of thyrocytes, thyroid-derived fibroblasts and thy- 
roid carcinoma cell lines without stimulation (control) and after stimulation with 10 U/mL IL-lcr and 100 U/mL TNF-a and 
10 ng/mL PMA after 24 hours. Serial dilutions of known amounts of the competitor fragment were coamplifjed with identi- 
cal aliquots of cDNA. The 560-bp (cDNA) and 478-bp (competitor) PCR products were visualized by agarose gel electrophoresis 
and ethidiumbrorriide staining. The relative concentration of the added competitor was given in arbitrary units (AU) in the 
figure. One AU was defined as the lowest concentration of the competitor yielding a detectable amplification for MMP-1 
mRNA. The ratio of competitor to cDNA fragments was determined by measuring the intensity of ethidium fluorescence with 
a CCD image sensor and analysis of data. Measured cDNA concentration can expressed in AIL 



DISCUSSION 

Our findings demonstrate for the first time that thyroid 
carcinoma cell lines are able to express MMP-1 and T1MP- 
1 mRNA and protein at significant levels in vitro. The ob- 
servation of spontaneous release of MMP-I and T1MP-1 
corresponds well with earlier studies covering the secretion 
of these proteins by several carcinoma cell lines (33,34). 

However, in conrrast to its clear physiological function 
in extracellular matrix breakdown, the role of MMP-J in 
tumor growth and metastases is still controversial 
(9-11,3.5). Recently, Murray et al. (10) demonstrated that 
MMP- ! is associated with poor prognosis in colorectal can- 
cer, and has a prognostic value independent of the Dukes 
stage. Therefore, MMP-J could be a target for therapeu- 
tic iotervenuon in such tumors. Furthermore, the hypoth- 
esis of whether or not cancer cells themselves arc able to 



produce MMP, or whether cancer cells stimulate the sur- 
rounding stromal cells to secrete MMP in vivo y is disputed. 
MMP-1 mRNA and protein were detected by both in situ 
hybridization and imrnunohistochemistry in stromal as 
well as tumor cells of head, neck, gastric, colorectal, and 
mammary carcinomas (9,10,36,37). In contrast, 
Kameyama (12) demonstrated by in situ hybridization that 
the MMP-1 mRNA was not expressed in the cancer cells 
but in the surrounding fibrous capsules of strongly differ- 
entiated papillary thyroid carcinoma tissue. Highly differ- 
entiated follicular carcinomas and follicular adenomas 
were depleted for MMP-1 transcripts. Undifferentiated fol- 
hculary, papillary, and aggressive anaplastic carcinomas 
that showed poor prognosis and strong tumor invasive and 
metastatic potential and that can be compared in their 
morphological, genetic and growth features with undiffer- 
entiated thvroid carcinoma cell lines were not included this 
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FIG 4. Competitive TLMP-1 mRNA RT-PCR yielding a 351-bp (cDNA) and a 274-bp (competitor) PCR product. For fur- 
ther details see Figure 3. 



study. However, the missing expression of MMP-1 by nor- 
mal thyrocytes and the spontaneous secretion of this pro- 
tein by highly malignant thyroid carcinoma cell lines, as 
demonstrated in our study, indicate the involvement of 
MMP-1 secretion of transformed thyrocytes in aggressive 
thyroid tumors. 

Although all cell lines analyzed in our study sponta- 
neously secreted MMP-1, we observed marked differences 
m the basal secretion capacity, the highest MMP-1 levels 
were determined in cultures of 8505 C cells. Only 8505 C 
cells expressed TPO and Tg mRNA that may be put down 
to residual differentiated components in the cell line (see 
Materials). However, none of the analyzed cell lines ex- 
pressed TSH-R mRNA. The cell population doubling times 
were less than 40 hours. AH cell lines had accumulations 
of multiple genetic events. These facts indicate the undif- 
ferentiated pathology of the studied lines. Jt is well known 
that anaplastic carcinoma cell lines well retain the malig- 
nant characterises of their parental tumors (38--40). 

Furthermore, we found a distorted proportion berween 
MMP-I and HMP-I rnRNA/pr otein for carcinoma cell 
lines but not for normal thyroid-derived fibroblasts. The 
most disadvantageous constellation berween MMP-1 and 
TIMP-1 was found in 8505 C cells. Similar to other stud- 
ies (41), these results suggest the influence of an altered 
MMPyTIMP relation on minor progiession. However, it 



should be mentioned that most studies, including the pre- 
sent one, do not take into consideration that a number of 
inhibitors distinct from TJMP-1 may regulate MMP-J ac- 
tivity. Taking into account that the balance of active en- 
zyme and TIMP-3 concentration strongly influence the ex- 
tent of local matrix degradation, a number of studies 
showed unexpectedly high levels of TIMP-1 in maJignant 
neoplasms (9,42,43). There is a great discussion as to 
whether the overall expression of MMP-1 and TIMP-1 or 
the amount of noncomplexed MMP-1 could be critical in 
aggressive tumor development. This fact underlines the na- 
ture of tissue breakdown, reflecting the complicated net- 
work of selective and coordinated production of individ- 
ual proteinases, and inhibitors under normal and 
pathophysiological conditions. Thus, the invasive and 
metastatic potential of thyroid tumors depends on the lo- 
cal net level of active MMPs. 

The synthesis of MMP-1 and TIMP-1 is influenced by a 
variety of biochemical stimuli. The recent findings on 
MMP-J and TIMP-1 gene promoters are useful in under- 
standing the complex mechanisms implied in the regula- 
tion of MMP synthesis modulated by cytokines ar\d tumor 
promoters (34,44,45). The promoter regions contain tu- 
mor promoter responsive elements (TRE) and binding mo- 
tifs for the transcription factor PEA-3, which are recog- 
nized by proio- oncogenic transcription factors, such as the 
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FIG. 7. Comparison between (i) free/complex and {») T1MP- 
1 completed MMP-1 levels, and (in) T1MP-1 levels in super- 
natants of unstimulated 8505 C and HTh 74 cells, and thy- 
roid-derived fibroblasts after 24 hours using a (i) MMP-t 
ELISA that recognizes total MMP-1 (see Figure 5). The (ii) 
MMP-l/nMM assay recognizes MMP-l/TIMP-1 complexes, 
ie, activated MMP-1 that has subsequently been complexed 
with the specific MMP inhibitor TIMP-1. The (hi) T1MP-1 
ELISA recognizes total "TIMP-1, ie, free TIMP-1 and that com- 
plexed with MMPs, 



fos and fun family (45-47). IL-1, TNF-a, and PMA up- 
regulate proto-oncogenes like fos and jun, resulting in the 
stimulation of MMP-1 and TIMP-1 (45,48). The action of 
the cytokines is mediated by their specific receptors. In our 
study, 1L-1R (type I and type II), TNF-aR (p75 and P 55) 
and IFN-t-R mRNAs were demonstrated in a J! investigated 
cell types. PMA and 1L-1 were shown to elevate MMP-1 
and T1MP-] in nearly all cell types investigated, thus con- 
firming the results of several studies on other epithelial cells 
(reviewed in refs. 1,7,49). In the majority of experiments, 
we found a concordant expression of MMP-1 and 71MP- 
1 after stimulation, possibly achieved by the coordinated 
actions of the nuclear transcription factors, although MMP- 
1 and BMP-1 expression can also be independently or even 
reciprocally regulated (1). The effect of TOE- a was not as 
distinct as in the case of PMA and IL-1, although several 
investigators found a pronounced effect of TNF-er particu- 
larly on T1MP-] secretion (4,34). In contrast to studies per- 
formed with other cell lypes (863,864,819), IFN-y did not 
mfluence MMP-1 and T1MP-1 expression in thyroid carci- 
noma cell hues. In summary, the involvement of the in- 
trathyroidal physiological and pathological cytokine mi- 
croenv>ronment in the regulation of MMP-1 and TJMP-1 
induction activarion and inhibition is strongly suggested. 

Furthermore, the data demonstrate thar regular human 
rhy/ocyrcs J,d not produce MMP ), even after powerful 
stimulation with PMA. Investigating other mammalian ep- 
ithehat cells, only one study revealed the production of 
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MMP-1 by rabbit corneal cells (21). It is yet not clear 
whether the MMP-1 mRNA detected in thyrocytes is due 
to a low level of constitutive transcription of the MMP-1 
gene {illegitimate transcription), an existing pool of stable 
MMP-1 mRNA, or. in vitro induction of MMP-1 mRNA. 
But h seems more likely that residua) fibroblasts contained 
in the purified thyrocyte preparation (< 0.2%) are respon- 
sible for the slightly positive RT-PCR results. Another ex- 
planation could be that thyrocytes are indeed MMP-1 pro- 
ducers, but the ELISA detection system used was not 
sensitive enough to measure extremely low MMP-1 secre- 
tion levels. Furthermore, the discrepancy between elevated 
T1MP-1 mRNA ievels pf thyrocytes and the extremely low 
T1MP-1 protein secretion by these cells is difficult to ex- 
plain. Posi-rxanscriptional regulatory events may be re- 
sponsible for this confounding result. 

Taken together, the present study suggests that the in- 
trathyroida! cytokine microenv iron men t is involved in the 
regulation of MMP-1 and its inhibitor HMP-1 in the thy- 
roid, and that both proteins may be secreted by dediffer- 
entiated thyroid carcinoma cells and involved in aggressive 
thyroid tumors in vivo. 
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Expression of embryonic fibronectin isoform EUFA parallels alpha-smooth 
muscle actin in maturing and diseased kidney. 

Barnes VL , Musa J . Mitchell RJ , Barnes JL . 

Department of Medicine, Division of Nephrology, University of Texas Health Science 
Center, San Antonio, Texas, USA. 

hi this study we examined if an association exists between expression of an alternatively 
spliced "embryonic" fibronectin isoform EIHA (Fn-EIHA) and alpha-smooth muscle 
actin (alpha-SMA) in the maturing and adult rat kidney and in two unrelated models of 
glomerular disease, passive accelerated anti -glomerular basement membrane (GBM) 
nephritis and Habu venom (HV)-induced proliferative glomerulonephritis, using 
immunohistochemistry and in situ hybridization. Fn-EHIA and alpha-SMA proteins were 
abundantly expressed in mesangium and in periglomerular and peritubular interstitium of 
20-day embryonic and 7-day (D-7) postnatal kidneys in regions of tubule and glomerular 
development Staining was markedly reduced in these structures in maturing juvenile (D- 
14) kidney and was largely lost in adult kidney. Expression of Fn-EHIA and alpha-SMA 
was reinitiated in the mesangium and the periglomerular and peritubular interstitium in 
both models and was also observed in glomerular crescents in anti-GBM nephritis. 
Increased expression of Fn-EIHA mRNA by in si tu hybridization corresponded to the 
localization of protein staining. Dual labeling experiments verified co-localization of Fn- 
EIHA and alpha-SMA, showing a strong correlation of staining between location and 
staining intensity during kidney development, maturation, and disease. Expression of 
EIHA mRNA corresponded to protein expression in developing and diseased kidneys and 
was lost in adult kidney These studies show a recapitulation of the co-expression of Fn- 
EIHA and alpha-SMA in anti-GBM disease and suggest a functional link for these two 
proteins. 
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BMI 1 ! geue amplification and overexpressiou ia hematological 
malignancies occur mainly iu mantle cell lymphomas. 

BcaS , Tort F. Pinyol rVt, Puig X . Hernandez L , Hernandez S , Fernandez PL , van 
Lohuizen M / Colomer D , Camno E . 

The Hematopathology Section, Laboratory of Anatomic Pathology, Hospital Clinic, 
[nstitut d f Investigacions Biomediques August Pi i Sunyer (IDIBAPS), University of 
Barcelona, Spain. 

The BMI-1 gene is a putative oncogene belonging to' the Polycomb group family that 
cooperates with c-myc in the generation of mouse lymphomas and seems to participate in 
cell cycle regulation and senescence by acting as a transcriptional repressor of the 
INfC4a/ARF locus. The BMI- 1 gene has been located on chromosome I Op 13 , a region 
involved in chromosomal translocations in infant ieukemias, and amplified in occasional 
non-Hodgkin's lymphomas (NHLs) and solid tumors. To determine the possible 
alterations of this gene in human malignancies, we have examined 1 60 
lymphoproliferative disorders, 13 myeloid Ieukemias, and 89 carcinomas by Southern 
blot analysis and detected BMI-1 gene amplification (3- to 7-fold) in 4 of 36 (1 1%) 
mantle cell lymphomas (MCLs) with no alterations in the INK4a/ARF locus. BMI-l and 
pI6INKL4a rnRNA and protein expression were also studied by real-time quantitative 
reverse transcription-PCR and Western blot, respectively, in a subset of NHLs. BMI-1 
expression was significantly higher in chronic lymphocytic leukemia and MCL than in 
follicular lymphoma and large B cell lymphoma. The four tumors with gene 
amplification showed significantly higher rnRNA levels than other MCLs and NHLs with 
the BMI- 1 gene in germline configuration. Five additional MCLs also showed very high 
rnRNA levels without gene amplification. A good correlation between BMI-1 rnRNA 
levels and protein expression was observed in all types of lymphomas. No relationship 
was detected between BMI-1 and p!6INK4a rnRNA levels. These findings suggest that 
BMI-l gene alterations in human neoplasms are uncommon, but they may contribute to 
die pathogenesis in a subset of malignant lymphomas, particularly of mantle cell type. 
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BMI-1 Gene Amplification and Overexpression in Hematological Malignancies 
Occur Mainly in Mantle Cell Lymphomas 1 

Silvia Bea, Frederic Tort, Magda Pinyo), Xavier Puig, Luis Hernandez, Silvia Hernandez, Pedro L. Fernandez, 
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Abstract 

■ The BMJ-1 gene is a putative oncogene belonging to the Polycomb 
group family that cooperates with c-myc in the generation of mouse 
lymphomas and seems to participate in cell cycle regulation and senes- 
cence by acting as a transcriptional repressor of the JNK4a/ARF locus. 
The BMJ-J gene has been located on chromosome 10pl3, a region involved 
in chromosomal translocations in infant leukemias, and amplified in* 
occasional non-Hodgldn's lymphomas (NHLs) and solid tumors. To de- 
termine the possible alterations of this gene in human malignancies, we 
have examined 160 rymph ©proliferative disorders, 13. myeloid leukemias, 
and 89 carcinomas by Southern Mot analysis and detected BMJ-1 gene 
amplification (3- to 7-fold) in 4 of 36 (11%) mantle cell lymphomas 
(MCLs) with no alterations in the INK 4 a/A RF locos. BMI-J and pi 6 ,NK4 * 
mRNA and protein expression were also studied by real-time quantitative 
reverse transcription-PCR and Western blot, respectively, in a subset of 
NHLs. BMJ-1 expression was significantly higher in chronic lymphocytic 
leukemia and MCL than in follicular lymphoma and large B cell lym- 
phoma. The four tumors with gene amplification showed significantly 
higher mRNA levels than other M.CLs and NHLs with the BMI-1 gene in 
germlinc configuration. Five additional JVJCLs also showed very high 
mRNA levels without gene amplification. A good correlation between 
BMJ-1 mRNA levels and protein expression was observed in all types of 
• lymphomas. No relationship was detected between BMJ-1 and pJ6 ,rvK4 * 
mnt^A levels. These findings suggest that BMJ-J gene alterations in 
human neoplasms are uncommon, but tbey may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 
type. 

Introduction 

The BMJ-J 3 gene is a putative oncogene of the Polycomb group 
originally identified by retroviral insertions! mutagenesis in Ejj.-c- 
myc transgenic mice infected with the Moloney murine leukemia 
vims (1, 2). These animals had a rapid development of pre-B cell 
lymphomas showing frequent proviral insertions near the BMJ-1 gene. 
This integration resulted in BMI-J overexpression suggesting a coop- 
erative effect between C-MYC and BMI-J genes in the development of 
these rumors (3, 4). Recent snidies have indicated that the BMJ-1 gene 
may also participate in cell cycle control and senescence through the 
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JNK4a/ARF locus by acting as an upstream negative regulator of 
p)6 iNK4a aj)d pJ 4/ p , 9 ARF gene expression (5). The human BMJ-J 
gene has been mapped to chromosome 10pl3 (6), a region involved in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant X cell lymphomas (8, 9). More recently, high-level 
DNA amplifications of this region have been found by comparative 
genomic hybridization in NHLs and solid tumors (10, 11). However, 
the possible implication of the BMI-J gene in these alterations and its 
role in the pathogenesis of human rumors is not known. The aim of 
this study was to analyze- the possible BMJ-J gene alterations and 
expression in a large series of human neoplasms and to determine the ' 
relationship with JNK4a/ARF locus aberrations. 

Materials and Methods 

Case Selection. A series of 262 human tumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table 1), matched normal tissues from 
all carcinomas, J 1 samples of normal peripheral mononuclear cells, and 5 
reactive lymph nodes and tonsils, were selected based on the availability of 
frozen samples- for molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNase treatment; 15 fig were digested with EcoKl 
and //mdlll restriction enzymes (Life Technologies, Inc., Gaithersburg, MD), 
for Southern blot analysis and hybridized with a 1.5-kb Pstl fragment of the 
partial BMl-l cDNA (6). 

RNA Extraction and BeaKriroe Quantitative JtT-PCK. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs) using guanidine/isothiocyanate extraction and cesium/chloride gradient 
centrifugal ion. One u.g of total RN A was transcribed into cDNA using 
MMLV-reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of the BM11 and the pi 6 
detection probes and primers were designed using (be Primer Express program 
(Applied Biosystems, Foster City) as follows: BMJ-J sense, 5'-CTGGTTGC- 
CCATTGACAGC-3'; .BMJ-J antisense, 5-CAGAAAATGAATGCGAG- 
CCA-3': pi 6 sense, 5 '-CA ACGCACCG A ATAGTTACGG-3'; pi 6 antiserise, 
5'-AACTTCGTCCTCCAGACTCGC-3'. The probes BMI-1, 5'-CAGCTC- 
GCTTCAAGATGGCCGC-3', and P 16 t 5'-CGGACGCCGATCCAOGTGG- 
GTA-3\ were labeled with 6-carboxy 7 fluorescem as the reporter dye. The 
TaqMon-GAPDH Control Reagents (Applied Biosystems) were used to am- 
plify and detect the GAPDH gene, as recommended by the manufacturer. The 
quantitative assay amplified 1 pim) of cDNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosystems). All reactions were performed in an AB1 PRISM 7700 Sequence 
Detector System (Applied Biosystems). GAPD1L BMI-1, and plG™* 4 " ex- 
pression was related to a standard curv e derived from serial dilutions of Raji 
cDNA. The RUs of BMI-1 and p)^™ 4 ' expression were defined as the 
mRNA levels of these genes normalized to (he GADPU expression level in 
each case. 

Protein Analysis. Whole-cell protein extracts were obtained from addi- 
tional frozen tissue available in 31 cases (7 CLLs. 12 MCLs, 8 FLs, and A 
LCLs) r londcd onto n J0% SDS-polyacry (amide £eL and eJectroMotied to a 
nitrocellulose membrane (Amcrsham). Blocked membranes were incubated 
sequentially with the monoclonal antibody BMI-F6 (J?), antimouse conju- 
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Tabic I 



Hematological malignancies and solid tumor samples analyzed for BMJl 
gene alterations 



Tissue samples 


No. of cases 


'u. r - 
rteniatological rnsl) tenancies 




HcxipkiD's disease 


2 


B cell lymphoprolifdativc disorders 




B-Acutc lymphoblastic leukemia 


14 


. CLL 


29 


Hairy cell leukemia 


4 


FL 


15 


MCL 


36 


LCL 


40 


T cell lympboprojifcrative disorders 




T-Aculc lymphoblastic leukemia 


8 


large granular cell leukemia 


4 


Peripheral T-cell lymphoma 


8 


Myeloproliferative disorders 




Acute myeloid leukemia 


7 


Chronic myeloid leukemia 


6 . 


Solid tumors 




Colon carcinoma 


26 


Breast carcinoma 


29 


Laryngeal squamous cell carcinoma 


34 


Total 


262 ■ 



gated to horseradish peroxidase (Amersbam), and detected by enhanced chemi- 
luminescence (Amersham) according to the manufacturer's recommendations. 

Statistical Analysis. Because of the non-normal distribution of the samples 
and the small. size of some subsets of rumors, the statistical evaluation was 
. performed using nonparametric tests (SPSS, version 9.0), Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
the Kruskal-WaJlis Test, with a P for significance set at 0.05. For differences 
between particular groups, the conservative Bonferroni procedure was per- 
formed, and the P was set at 0.005. The remaining statistical analyses were 
carried out using the Mann- Whitney nonparametric U test (significance, P 
<0.05). The comparison between BMM and pie^ 4 ' quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

Results 

BMJ-J Gene Amplification. The BMJ-J gene was examined by 
Southern bJot in a large series of human tumors and normal samples 
(Table I). The cDNA probe used in the study detected three £coK\ 
fragments of 7.3, 3:8, and 2.6 kb and three //mdlTI fragments of 6.2, 
4, and 3.5 kb. BMJ-J gene amplification (3- to 7- fold) was detected in 
A of 36 (11%) MCLs (Fig. 1). The amplifications were confirmed with 
both restriction enzymes. The amplified MCLs were two bJastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with their respective matched non- 
neoplastic mucosa. No BMJ-J gene rearrangements were observed in 
any of the samples examined. 

BIVH-J mRNA Expression. To determine the BMM expression 
pattern in NHL we analyzed BMM mRNA levels by real-time quan- 
titative RT-PCR in 67 lymphomas ( 1 0 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BML 1 mRNA expression pattern was observed in the different types 
of lymphomas (Fig. 2; Kmskal-Wallis Test; P < 0.001). The BM) 
mRNA levels m CLLs (mean, 2.2 RU; SD, L3) and MCLs with no 
BMJ-J gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6 RU; 
SD, 0.4; Mann- Whitney nonparametric V test; P < 0.01). The 4 
MCLs with BMJ-J gene amplification showed significantly higher 
levels of expression than all other groups of mmors (mean, 5.) RU; 
SD, 1.6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of the gene also showed very high levels of BML I mRNA 
cxpicssion ranging from 4 to 9.8 RU, similar to cases with gene 
amplification (Fig. Z4). 

BMM Protein Expression. BMM protein expression was exam- 
ined by Western blot in 31 mmors (7 CLLs; J 7 MCLs. including two 
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rases with BMJ-J gene amplification and 4 cases with mRNA over- 
expression and no structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody against BMM detected three closely migrating proteins of 
M r 45,000-48,000 (2). The two more slowly migrating bands prob- 
ably represent phosphorylated isoforms of the protein (12). The two 
MCLs with gene amplification and three of four cases with mRNA 
overexpression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed . 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that .BMM protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMM and pl6 ,NK4a Gene Alterations. 

The JNK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of the BMJ-J gene, 
suggesting that this gene may contribute to human neoplasias with 
wild type JNK4/AJIF (5). Most of the lyrophoproliferative disorders 
analyzed in the present study, including the four cases with BMJ-J 
gene amplification, had been previously examined for p53 gene mu- 
tations and JNK4a/ARF locus alterations, including gene deletions, 
mutations, hypermethylation, and expression (13, 14). The four MCLs 
with BMJ-J gene amplification and mRNA overexpression and the. 
five tumors with BMM mRNA overexpression with no structural 
alterations of the gene showed a wild-type configuration of the 
JNK4a/ARF locus (13). However, one case with BMJ-J gene ampli- 
fication and one case with mRNA overexpression with no alteration of 
the gene showed p53 gene mutations associated with allelic deletions. 

To determine the possible relationship between BMM and 
pl6 ,NK4a mRNA expression, p}6™ KA * mRNA levels were evaluated 
by real- time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs. 
and 13 LCLs), including 6 cases with alterations in the JNK4a/ARF 
locus (2 MCLs and 1 LCL with pJd™ 4 * gene deletion, 2 LCLs with 
pJ6 promoter hypermethylation. and 1 CLL with pJ6 iriKAa gene 
mutation), and the 4 lymphomas with BMJ^J amplification: Negative 
or negligible levels of pJ 6 J>,K4:> were observed in the 6 rumors with 
JNK4o/ARF locus alterations. These cases were not included in the 
comparisons between BMM and pl6 INK4a mRNA expression. The 
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expression levels were relatively similar in the different 



types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach statistical significance (Fig. 2B). No 
differences were observed in the pie™* 4 * mRNA levels between 
tumors with BMJ-J gene amplification and overexpression and lym- 
phomas with germline configuration of the gene. 
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Fig. I. Southern bloi analysis of BM! ) t 'ciw. Foui MCU (MCL*) showed BMLl gene 
amplifies* ion (3- 10 7- fori!) compared wiih non neoplastic tissues (/v)aod other NHLs! No 
amplifications 01 gene rcansn^mnii-. were detected in I he remaining NHLs and carci- 
nomas included in the study. 
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Fig. 2. A, quantitative BMM mRNA transcript analysis (median and range) using 
rtaMime RT-PCR in a scries of NHLs. MCLs with BW-J gene amplification (MCI*) 
revealed significantly higher overall BMJ-J mRNA levels than alt other types of NHLs. 
including MCLs with no structural alterations of the gene (P < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs (P < 0.01) J). Results are depicted 
as the ratio of absolute BMM:GADPH mRNA transcript numbers (RU). Bars, SD. B r 
quantitative pl6 ,NK4 " mRNA transcript analysis (median and range) using real-time 
KT-PCR in a series of NHLs. Expression levels were relatively similar in the different 
types of rumors. Results are depicted as the ratio of absolute pI6 ,NK4, *:GADPH mRNA 
transcript numbers (RU). Borj, SD. 



Discussion 

In the present study, we have examined a large series of human 
tumors for the presence of gene alterations and mRNA expression of 
the BMJ-J gene. Gene amplification was identified in four MCLs. 
These tumors showed significantly higher levels of mRNA and pro- 
tein expression compared with other lymphomas with BMJ-J in germ- 
line configuration. BMM expression levels were also highly up- 
rcgulated in a subset of MCLs with no apparent structural alterations 
of the gene. No alterations were detected in any of the different types 
of carcinomas included in the study. BMJ-J is considered an oncogene 
belonging to the Polycomb group family of genes. These proteins 
mainly act as transcriptional regulators, controlling specific target 
genes involved in development, cell differentiation, proliferation, and 
senescence. Different studies have shown the implication of BM1- 1 
overexprcssion in the development of lymphomas in murine and 
feline animal models (3. 4). The findings of the present-study indicate 
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for the first time that BMJ-J gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type. 

The human BMJ-J gene has been mapped to chromosome J Op 13. 
High-level DNA amplifications and gains in this region have been 
identified by comparative genomic hybridization in occasional solid 
rumors and NHLs (10, 1J). Different chromosomal translocations 
involving the I Op J 3 region have also been identified in infant leuke- 
mias and T cell lymphoproJiferative disorders (7, 8, 15). Most acute 
Jeukemias with this chromosomal alteration occur in children <12 
months of age, whereas it seems to be extremely rare in adults. lOp 
translocations in T-cell lymphoproliferative disorders have been ob- 
served mainly in adult T cell leuJcemia/Jymphomas and occasional 
cutaneous T cell lymphomas. In our study, we did not observe BMJ-J 
. rearrangements or amplifications in any of the acute Jeukemias or T 
cell lymphomas. However, all of the acute Jeukemias in this study 
were diagnosed in patients over ) 6 years, and no adult T cell leuke- 
mia/lymphomas or cutaneous lymphomas could be included in the 
series. Similarly. high-level DNA amplifications at the 1 Op 13 region 
have been detected in head and neck carcinomas and other solid 
rumors. Although we found no. evidence for BMJ-J gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these rumors. Additional studies are required to elucidate the 
possible involvement of BMJ-J in these particular groups of human 
neoplasms. 

In human hematopoietic cells, BMI-I is preferentially expressed in 
primitive CD34 + bone marrow cells, whereas it is negative or very 
Jow in more mature CD34- cells (16). Jn peripheral lymphocytes, and 
particularly in follicular. B cells, BM1-1 protein expression has been 
detected in resting cells of the mantle zone, whereas it is down- 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BM1-) expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow 
and antigen-specific response in peripheral lymphocytes. BMM ex- 
pression in human tumors has not been examined previously. In this 
study, we have demonstrated that BM)- 1 mRNA and protein expres- 
sion show a distinct pattern in different types of lymphomas. Thus, 
BMJ- 1 levels were low in LCLs and FLs and significantly higher in 
MCLs and CLLs. These findings suggest that BM1-1 expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their normal cell counterparts; because FLs and at least a subgroup 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are.tumors mainly derived from . 
naive pregerminal center cells. However, the four MCLs with BMJ-J 
gene amplification expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations' 
of the gene showed high mRNA levels similar to those observed in 
tumors wiih BMJ-J gene amplification, suggesting that other mecha- 
nisms may be involved in up- regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMJ- J gene expression on skeleton development 
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rt£. 3. >VtMt in Mot analysis of BMM prorcin in NHLs. The amplified MCL (17624) 
shemeu* the higtirii BMI- ) protein levels, whereas other MCLs ami CLLs had intrirncdislr 
levels of o press ion. Very km in negative signal was observed in FLs ;.nd LCLs. 
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and lymphomagenesis (K 3). These observations suggest that the high 
mRNA and protein levels detected in a subset of MCLs may play a 
role in the pathogenesis of these neoplasms. 

Recent studies have identified the JNK4/ARF locus as a down- 
stream target of the BMI-l transcriptional repressor activity, suggest- 
ing that BMM overexpression may contribute to human neoplasias 
that retain the wild-type JNK4a/ARF locus (5). Interestingly, in our 
study, BMJ-J amplification and overexpression appeared in tumors 
with no alterations in pJd 1 *** 4 * and ^74^ genes. However, we could 
not detect differences in the expression levels of pJ6 nS,K4i in tumors 
with arid without BMJ-J gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than JNK4afARF are the main targets of 
BMM repressor activity in these rumors. Particularly, different genes 
of the HOX family are regulated by BMM and may also be involved 
in lymphomagenesis (19, 20). 

Jn conclusion, the findings of this study indicate that BMJ-J gene 
. expression is differentially regulated in B cell lymphomas. Alterations 
of the gene seem to be an uncommon phenomenon in human neo- 
plasms, but they may contribute to the pathogenesis in a subset of 
MCLs. Although, BMJ-J gene alterations occurred in tumors with 
wild-type JNK4a/ARF Jocus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMM in human neoplasms 
require additional analysis. 
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Histopathology is insufficient to predict disease progression and clinical outcome in lung adeno- 
carcinoma. Here we show that gene-expression profiles based on mkroarray analysis can be 
used to predict patient survival in early-stage lung adenocarcinomas. Genes most related to sur- 
vival were identified with univariate Cox analysis. Using either two equivalent but independent 
training and testing sets, or 'leave-one- out' cross-validation analysis with all tumors, a risk index 
based on the top 50 genes identified low-risk and high-risk stage I lung adenocarcinomas, which 
differed significantly with respect to survival. This risk index was then validated using an inde- 
pendent sample of lung adenocarcinomas that predicted high- and low-risk groups. This index 
included genes not previously associated with survival. The identification of a set of genes that 
predict survival in early-stage lung adenocarcinoma allows delineation of a high-risk group that 
may benefit from adjuvant therapy. 



Lung cancer remains the leading cause of cancer death in indus- 
trialized countries. Most patients with non-small cell lung can- 
cer (NSCLC) piesent with advanced disease, and despite recent 
advances in multi-modality therapy, the overall 10-yeaj survival 
rate remains a dismal 8- 30%'. However, a significant minority of 
patients (-25-30%) with NSCLC have stage I disease and receive 
surgical intervention alone. Although 35-50% of patients with 
stage I disease will relapse within 5 years 7 \ it is not currently 
possible to identify specific high-risk patients. 

Adenocarcinoma is currently the predominant histological 
subtype of NSCLC {refs. 1,5,6). Although morphological assess- 
ment of lung carcinomas can roughly stratify patients, there is a 
need to identify patients at high risk for recurrent or metastatic 
disease. Preoperative variables that aflect survival of patients 
with NSCLC have been identified 7 " 10 . Tumor size, vascular inva- 
sion, poor differentiation, high tumor-proliferative index and 
several genetic alterations, including K-ras (refs. 11,] 2) and p53 
(refs. 10,13) mutations, have prognostic significance. Multiple 
independently assessed genes oi gene products have also been 
investigated to better predict patient prognosis in lung can- 
cer' 4 ,f . Technologies that simultaneously analyze the expression 
of thousands of genes' 9 can be used to correlate gene-expicssioir 
patterns with numerous clinical parameters— including patient 
outcome— to better piedirt tumor behavior in individual pa- 
tients™ Analyses of lung cancers using array technologies have 
identified subgroups of tumors thai differ according to tumor 
type and histological subclasses and, to a lesser extent, suiviv.-il 
among adenocarcinoma patienls :i:7 . Heie we correlated gene- 
ex pi essi on profiles with clinical outcome in a cohort of patients 
with lung adenocai cinema and identified specific genes that 



predict survival among patients with stage 1 disease. For further 
validation, we also show that the risk index predicted survival in 
an independent cohort of stage I lung adenocarcinomas. 

Hierarchical profile clustering yields three tumor subsets 
Using oligonucleotide anays, we generated gene-expression pro- 
files for 86 primary lung adenocarcinomas, including 67 stage I 
and 19 stage III tumors, as well as 10 non neoplastic lung sam- 
ples. Selected sample replicates showed high correlation among 
coefficients and reliable reproducibility. We determined tran- 
script abundance using a custom algorithm and the data set was 
trimmed of genes expressed at extremely low levels, that is, 
genes were excluded if the measure of their 75th percentile value 
was less than 100. Although potentially resulting in the loss of 
some information, trimming in this manner decreased the possi- 
bility that the clustering algorithm would be strongly influenced 
by genes with lit lie or no expression in these samples. 
Hierarchical clustering with the resulting 4,966 genes yielded 3 
clusters of tumors (Fig. 3). All 10 non-neoplastic samples clus- 
tered tightly together within Cluster 1 (data not shown). We ex- 
amined the relationships between cluster and patient and tumor 
characteristics {Fig. 1 and Supplementary Figure A online). There 
were associations between cluster and stage [P - 0.030) and be- 
tween cluster and differentiation {P= 0.0J). Cluster 1 contained 
I he greniesl percentage ('12.8%) of well differentiated tumors, 
followed by Cluster 2 (27%) and Cluster 3 (4.7%). Cluster 3 con- 
la in ed the highest percentage of both poorly differentia ted 
N?.6%) and stage 111 tumors (-12.8%), yet contained .3 fM.3%) 
moderately differentiated ami 3 (5%) well dif leientiaied si age 1 
tumor. Notably. II stage I turnots were piesent in Cluster 3, sug- 
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gesting a common gene- expression profile for 
this subset of stage I and stage III tumors. 

For patients with stage ) and stage III tumors, 
the average ages were 68. J and 64.5 years and 
the percentage of smokers was 88.9% and 
89.5%, respectively. Marginally significant as- 
sociations between cluster and smoking his- 
tory were observed {P = 0.06). A significant 
relationship between histopathological classifi- 
cation and cluster was only .discemable for 
bronchioloalveolar adenocarcinomas {BAs), 
which were only present in Clusters 1 and 2 
(P = 0.0055) and comprised 35.7% and 12.3% 
of tumors for Clusters 1 and 2, respectively. 

We examined the heterogeneity in gene-ex- 
pression profiles based on the trimmed data. set among normal 
lung samples and stage 1 and stage 111 adenocarcinomas by calcu- 
lating correlation coefficients between all pairs of samples. In 
contrast to normal lung samples that displayed highly similar 
gene-expression profiles (median correlation, 0.9), both stage 1 
and 111 lung tumors demonstrated much greater heterogeneity in 
their expression profiles with lower correlation coefficients (me- 
dian values, 0.82 and 0.79, respectively). 

Northern-blot and immunohistochemistry analyses 
Of the 4,966 genes examined, 967 differed significantly between 
stage I and 111 adenocarcinomas, a number in excess of that ex- 
pected by chance alone (248 at alpha level (a) = 0.05). Three 
genes were arbitrarily selected to verify the microarray expression 
data. The mRNA from 20 of the normal lung and tumor samples 
was examined by northern-blot hybridization with probes for in- 
sulin like growth factor-binding protein 3 {JGFBP3), cystatin C 



Cluster I 



Ouster 2 



Ouster 3 



iff 



Fig. 1 Unsupervised classification analysis of lung adenocarcinomas. 3 classes of tumors identi- 
fied by agglometauve hierarchical clustering of gene- expression profiles using the 4;966 expressed 
genes. Patient and histopathological information for each lung adenocarcinoma case by cluster 
designation and methods for K-roj 12/13th-codon mutational status and nuclear p53 protein ac-. 
cumulation are provided (Supplementary Figure A online). IN classification denotes information 
regarding patient tumor size and nodal involvement. Associations between cluster membership 
and patient or histopathological variables are indicated at significance level {P< 0.05). 



arid lactate dehydrogenase A (LDH-A) (Fig. 2a). Two gene probes 
not represented on the mi cr oar jays were used as controls, includ- 
ing histone H4, a potential index of overalfcell proliferation, and 
28S libosomal RNA, a control for sample loading and transfer. 
The relative amounts of 1GFBP3, cystatin C and Z.DH-A mRNA 
strongly correlated with micr oar ray-based measurements (Fig. 
2b). In both assays, JGFBP3 and LDH-A mRNA levels increased 
from stage 1 to stage 111 adenocarcinomas and were higher than 
those in normal lung. Cystatin C mRNA levels were more variable 
but relatively greater in normal lung than tumors. These results 
suggest that the oligonucleotide microariays provided reliable 
measures of gene expression. The tumors showed slightly greater 
histone H4 expression than the normal lung, likely reflecting in- 
creased proliferation of tumor cells. 

Immunohistochemistry was pejformed lor 1GFBP3, cystatin C 
and MSP- 70 to determine whether mRNA over expression was re- 
flected by an increase of their coriesponding proteins in tumors. 
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tig. 7 Validation analyses of gene-e*pies- 
sion profiting. «, Northem-biot analysis ot 
selected candidate genes foi verification- of 
data obtained from oligonucleotide arrays, 
"the same sample RNA tor the 4 uninvolved 
lung, 8 stage I and 8 stage HI tumors was 

uied lor the nor ther n-blot and oligonucleotide array analyses, 
b. Conf lation analysis of quantitative data obtained from oligonucleotide 
arrays and noil I .em blots measured by integrated phosphor imager -based 
signals tor the \CJBP3 and IDH-A genes. the ratio ot iGfBP?, cystatin C 
and / 01 f A mRNA to ?8$ rRNA was determined. The relative value.*, loi 
eat h gene from each sample are shown. t\ non- neoplastic nor r rial luno; 
1. staoe I tumors; 3, stage MI tumors. f r Immimohislochemk al analysis ol 
ICIBP-jf, HhV-70 and cystatin C in lung and lung ad e not .an in or n as. 
Cytoplasmic. IGIBI , -3 immun&rpactivity in a neoplastic gland (tumor t 77) 
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IGFBP3 



HSP-70 




Cystalin C 



Cystatin C 



with prominent apical staining (blue reattant staining, arrow, upper letl). 
DiMuse cytoplasmic HSP-70 immimoreac tivity (tumor 177), yet stromal el- 
ements show no i eoc tivity (upper right). Normal lui>g parenchyma (lower 
lelt) shows cytoplasmic cystatin C immunoreai tivity in alveolar pneumo- 
(ytes (arrow) and intra- alveolar mac rophages but tumor (190) shows dif- 
fuse cytoplasmic cystatin C imieumoreitt tivity with prominent apical 
stainrnq (lower right). Magnification, > 700 




20 40 feO 80 )00 o 2 0 <0 so ^ t6o 

Time to death (months) - - ' . 

Jmmunoieactivity for both 1GFDP-3 and H$P-70 {Ftg. 2c) was de- 
tected in the cytoplasm of the adenocarcinomas, with little de- 
tectable reactivity in the stromal ox inflammatory cells. Cystatin 
C was detected in alveolar pneumocytes and intra-alveolar 
macrophages in non neoplastic lung paienchyma and also con- 
sistently in the cytoplasm of neoplastic cells. 

Cene-expression profiles predict survival 

As expected, Kaplan-Meier survival curve* (Fig. 3a) and log-rank 
tests indicated poorer survival among stage 111 compared with 
stage 1 adenocarcinomas (P = <0.000l). Two statistical ap- 
proaches were used to determine whether gene-expression pro- 
files could piedict survival using the data set of 4,966 genes. In 
one approach, equal numbers of randomly assigned stage I and 
stage If! tumors constituted training Or = 13) and testing (;r = 43) 
sets. In the training set, the top 10, 20, SO or 75 genes were used 
to create risk indices that were evaluated for their association 
with survival- using the 50th, 60th or 70th percentile cutoff 
points to categorize patients into high or low groups. The results 
were similar across cutoff points but the 50- gene risk index had 
I he best overall association with survival in the training set. 



Fig. 3 Gene-expression profiles and patient survival, a. Relationship be- 
tween tumor stage and patient survival (stage 1 and stage 3 differ signifi- 
cantly, P< 0.0001). b. Relationship between the survival. in the 43 test 
samples and their risk assignments based on the 50-gene risk index esti- 
mated in the 43 training samples. The high- and low-risk groups differ sig- 
nificantly (P = 0.024). c, Relationship between patient survival and the risk 
assignments in test samples (in b) conditional for tumor stage. The high- ■ 
and low-risk stage I groups differ significantly (P = 0.02B), whereas stage IH 
low- and high-risk groups did not (P= 0.634). rf. Relationship between sur- 
vival in the test cases and their risk assignments based on the 86 leave-one- 
out' cross-validation of the 50-gene risk index. The high- and low-risk 
groups differ significantly (P= 0.0006). r, Relationship between test case's 
risk assignment and survival (in d) conditional on tumor stage. The high- 
and low-risk stage I lung adenocarcinoma groups differ significantly from 
each other (P= 0.003), whereas low- and high-risk stage HI tumors do not. 
/, Relationship between tumor class identified by hierarchical clustering and 
patient survival. Survival for patients in Cluster 3 differed relative to the tu- 
mors in Cluster 2(P~ 0.037) and approached significance for Cluster 1 and 
2 combined (P- 0.06). g, Analysis of the Michigan- based risk index using 
top cross- validated survival genes identify a low- and high-risk group in an 
independent cohort of 84 Massachusetts- based lung adenocarcinomas that 
are significantly different (/>= 0.003). h, Among the 62 stage I lung adeno- 
carcinomas in the Massachusetts sample, the high- and low-risk groups dif- 
fered significantly (P = 0.006). 



After conservatively choosing the 60th percentile cutoff point 
from the training set, we then applied this risk index, and cutoff 
point to the testing set. The risk index of the top 50 genes cor- 
rectly identified low- and high-risk individuals within the inde- 
pendent testing set (F = 0.024) (Fig. 3b and Supplementary 
Methods online). Notably, 1 ] stage I tumors were included in 
the high-risk subgroup. When this risk assignment was then 
conditionally examined for stage progression (Fig. 3c), low- and 
high-risk groups among stage 1 tumors were found to differ (P = 
0.028) in their survival. 

Identification of a robust set of survival genes 
Although predictive of patient survival, a single training-testing 
set may not provide the most robust set of genes due to random 
sampling issues. Therefore, a *leave-one-oul' cross-validation ap- 
proach was used to identify genes associated with survival from 
all 86- tumor samples. We first developed a 50-gene risk index in 
each training set, and then applied the risk index to the test case 
held out from the hill set of tumors and assigned the held out 
tumor to the high or low-risk groups (Fig. 3d). The high and 
low-risk subgroups dele/mined in the test cases differed signifi- 
cantly in their overall survival (P = 0.0006). Among the larger 
group of stage I lung adenocarcinomas, the low-risk {n = 46) and 
high-risk [n = 2J) groups had markedly different survival {P = 
0.003) (Fig. 3e). Table 1 lists selected examples of the cumulative 
top 3 00 genes derived from this cross-validation procedure 
(complele list in Supplementary Table A online). 

It was also noted that many of the stage I patients in the high- 
risk subgroup (Fig. 3f) were present in Cluster 3 (Fig. 1). 
Kap J a n- Meier analysis (Fig. 3/) demonstrated a significantly 
worse survival [P = 0.037) lor patients in Cluster 3 relative to pa- 
tients in Cluster 2 and approaching significance for Cluster 1 
and 2 combined \P = 0.06). This further indicates the important 
relationship between gene- expression profiles and patient sur 
vival, independent of disease stage. 

Consistent with previous analyses of lung adenocarcinomas", 
40% of stage I and 57.8% ol stage HI tumors had ! 2th or )3th 
codon K-rus gene mutations. Those patients with tumois con- 
taining K-rr.s mutations showed a trend of poorer survival, but 
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Table 1 Selected examples of the top 100 genes from cross-validation 



Gene name P % P % Change in 

(normal versus \ Change in tumor (stage I versus stage HI 
tumor t-test) stage III Mest) 



CASP4 
P63 



KRT7 
LAMB1 

BMP2 
CDC6 

5100P 
SERPINE1 

STX1A 

ADM 

AKAP 12 

ARHE 

GRB7 

VECf 

WNT10B 



H5PA8 
ERBB2 
FXYD3 
SLC20A1 



C5TB 
a SI 
CYP24 

FUT3 

MLN64 
PDl 7A 
PtCL 
5K1A6 



COPIB 
CRK 



m la 



KIAA0005 
MGB1 



0.56 
9.73E-04 



8.02E-08 
0.14 

0.54 
1.31E-05 

2.10E-08 
2.89E-03 

8.65E-08 

0.05 
8:53E-03 

0.06 
2.02E-03 
6.50E-08 

0.05 



036 
0.04 
0.10 
1.34E-03 



1.57F-04 
0.48 . 
3.16E-06 

1.07F-07 

0.70 
0.1? 
0.04 
0.07 



0.10 
0.10 



?.?1l-0'1 

0.77 



-6% 
37% 



126% 
-20% 



-21% 
1070% 



1572% 
72% 



39% 
-47% 
-39% 

38% 
174% 

31%' 



8% 
92% 
111% 
58% 



50% 
-10% 
N/A 

114% 

32% 
33% 
-68% 
-32% 



-33% 
32% 



40% 
125% 



0.02 
0.03 



0.11 
0.01 

0.27 
0.05 

0.19 
0.25 

0.07 

0.04 

0.05 

0.05 . 

0.63 

0.02 

0.48 



9.01 E-04 
0.37 
0.31 
0.02 



0.15 
0.03 
0.97 

0.97 

0.42 
0.01 
0.35 
0.12 



0.26 
0.03 



0.02 
0.33 



57% 
43% 



55% 
60% 

47% 
148% 

77% 
30% 

26% 

117% 
214% 
87% 
15% 
85% 
20% 



51% 
120% 
73% 
66% 



34% 
67% 
2% 

-1% 

80% 
-35% 
-170% 

86% 



25% 
48% 



45% 
459% 



Coefficient Unigene comment 
f> 

Apoptosis related 

0.0022 Caspase 4, apoptosts- 

related cysteine protease 

0.0010 . Transmembrane protein (63 kD), 
endoplasmic reticulum/ 
Goigi intermediate compartment 

Cell adhesion and structure 
0.0003 Keratin 7 
0.0027 Laminin, pi 

Cell cycle and growth regulators 
0.0044 Bone morphogenetic protein 2 
0.0124 CDC6 (ceH division cycle 6, 

SocchpTomyces cerevhioe homolog) 
0.0001 SI 00 calcium* binding protein P 
0.0008 Serine (or cysteine) proteinase inhibitor, 

clade E{nexin). 
0.0031 Syntax in 1A (brain) 

Cell signaling 
0.0016 adienomedullin 

0.0010 A kinase (PRKA) anchor protein (gravin) 12 
0.0092 ras homolog gene family, member E 
0.0030 Growth factor receptor-bound protein 7 
0.001 3 Vascular endothelial growth factor 
. 0.0022 Wingless-type MMTV integration site family, 
member. 1 0B 

Cbaperones 
0.0008 Heat-shock 70 kD protein 8 

Receptors 

0.001 3 v-eib-b2 avian erythroblastic leukemia viral 

oncogene homolog 2 
0.0046 FXYD domain-containing ion transport 

regulator 3 

0.0021 Solute carrier family 20 (phosphate 

transporter), member 1 

Enzymes, cellular metabolism 
0.0001 Cystatin B (stefin B) 

0.0007 CathepsinL 

0.0008 Cytochrome P450, subfamily XXfV 

(vitamin D 24- hydroxylase) 
0.0033 Fucosyltiansferase 3 (galactoside 3(4)1- 

fucosyltransf erase, Lewis blood group included) 
0.0007 Steroidogenic acute tegulatory protein related 

-0.0187 Phosphodiesterase 7A 
-0.0011 Plasminogen -like 

0.0069 Solute carrier family 1 (high- all inity aspartate/ 
glutamate transporter), member 6 
Transcription and translation 
0.0016 Core promoter element binding protein 
0.0098 v-crk avian sarcoma vims O 10 oncogene 
homolog 

0.0034 v-rel avian relit uloendolhrliosis viral 
oncogene homolog A 
Unknown function 
0.0010 KIAA0005 gene produtt 

0.0018 Ma mm agio bin 1 



Boldedgrnei v.vtt alio lignilir an! lot .\uivival in 43 lumoi Iiaining lei {fig. 3b). 



Table 1 Selet red e>am|>les o* the cumulative top 100 genes identified using 
training- testing, i toss- validation of ail 86 lung tumor samples. The peitent 
change, as well as Ow 1 dirt c rion, for the average values of the 10 norv neoplastic 
lung to all tumors, and loi the 67 Jtage I to the 19 stage HI tumor s are shown. A 
positive coefficient (1 value is indicative of a relationship of gene expression to a 



poorer patient ouicoroc. The genes are listed in potential tunc tional categories. 
Genes that weie also present in the top 50 survival genes using the 43- tumor 
training set (Fig. 3b) are indicated in bold type. Complete lifting of the gene 
piobe sets and annotated gene and unigene identifier tan be found in the 
Supplementary Methods. 
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[ ig. 4 Gene expression patterns ot top survival gFnes u. Gene- expulsion patterns de- 
termined using agglomerative hieratchitat tlustering ol the 86 lung adenocarcinomas 
against the 100 survival-related genes (7 able 1) identified by the i raining testing, cross- 
validation analyst. Substantially elevated (red) or deneased (green) expression of the 
qenes is observed in individual tumors. 5ome tumors (blatV arrow and expanded area) 
show extremely elevated expression ot specific genes. />. An outlier gene expression pat- 
tern (>a times ihe interquartile range among all sampled is observed tor the nbB2 and 
KenlA genes (top Ictt and right, respectively). Ihe SI 001' and tfk genes (bottom lett and 
right, iespr-( tively) ihow a graded pattern ot t**press«on it-laled to patient survival. O, 
alive; dead (also in < ). «, "the number ot outliers pff prnon identified in the top 100 
nenrs plotted by survival distribution. 
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this difference did not ieach statistical significance among all 
patients (P= 0.25), between patients within tumor clusters (P = 
. 0.41) or when analyzed separately among stage 1 [P- 0.22) and 
stage }\\ {P- 0.53) patients. Nuclear accumulation of p53 was de- 
tected in 17.9% stage 1 and in 22.2% stage 111 tumors. No signifi- 
cant relationship was observed for p53 staining and patient 
survival, cluster or tumor stage. 

Confirmation using an independent set of adenocarcinomas 
The robustness of our 50-gene risk index in predicting survival in 
lung adenocarcinomas was tested using oligonucleotide gene-ex- 
pression data obtained from a completely independent 
(Massachusetts-based) sample of 84 lung adenocarcinomas (62 
stage I, 14 stage 11 and 8 stage 111; ref. 21, and dataset A at 
www.genome.wi.mit.edu/MPR/Iung). To ensure equivalent 
power for testing and comparability of samples, the criteria for 
including tumors in the analysis were 40% or greater tumor cellu- 
larity, no mixed histology (that is, adenosquamous) and patient 
survival information. To obtain comparative gene-expression 
measures between the two data sets, gene sequences present on 
the U95A and HuGeneFL array were examined, and expression 
data for our top 50 cross-validation genes for all 84 Massachusetts 
samples were obtained and processed 2 * (see also Supplementary 
Methods online ). When we examined the risk assignment of 
these 84 samples, employing the identical cutoff point used for 
the 86 Michigan- based Jung samples, we observed Jow- and high- 
risk groups (Fig. 3g; P= 0.003). Notably, among the 62 stage 1 tu- 
mors, high- and low-risk groups were observed that differed 
significantly (P - 0.006) in their survival (Fig. 3ft).. 

Survival genes had graded and outlier expression patterns 

A statistical and graphical analysis of the 100 survival-related 



genes (Table 1) clustered against all 86 tumors revealed individ- 
ual tumors with substantially elevated expression in both a lim- 
ited and larger number of genes (Fig. 4a). Among these genes, we 
observed two distinct patterns of expression related to patient 
survival. One pattern, designated 'outlier', included genes show- 
ing substantially elevated expression (greater than five times the 
interquartile range among all samples), whereas the. other pat- . 
tem, designated 'graded', was characterized by continuously dis- 
tributed expression with patient survival (Fig. 4b). The erbB2 and 
Reg} A genes are examples of outlier expression patterns and 
S100P and crJt genes of graded patterns. The number of outliers 
per person in the top 100 genes was identified and plotted ac- 
cording to survival times and events (Fig. 4c). Both stage I and 
stage Ml lung adenocarcinomas .showed outlier gene patterns 
and 10 tumors contained 3 or more outlier genes. 

Because gene amplification may result in increased gene ex- 
pression, the nine genes with outlier expression patterns {erbB2, 
SLCIA6, Wnt 1, MGB1, Reg} A, AKAPM, PACE, CYP24, KYNU) 
and one gene with a graded expression pattern (KK7T8) were ex- 
amined using quantitative genomic VCR to evaluate genomic 
copy number (Fig. So). Gene amplification of erbBZ (17ql 2) was 
detected in tumor L94, which had the highest erbEZ mRNA ex- 
pression (Fig. 4a). Gene amplification was not detected for any 
of the other seven tested genes in tumor L94, as well as in other 
tumors. The two genes most frequently demonstrating the out- 
lier pattern in these lung adenocarcinomas were KYNU and 
CYP24, and were present in. 10 and 9 tumors, respectively. 
CYP24 has been described as a gene amplified and over ex pressed 
in breast cancer", and these results indicate elevated expression 
in lung adenocarcinoma. 

To determine whether the graded or outlier gene-expiession 
patterns also occur at the piotein-expression level, 10 of the 100 
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lig. 5 Gene amplification and protein expression of survival-related genes, 
o. Analysis of potential gene amplification for 9 genes showing outlier expres- 
sion patterns in the lung tumors (?rb82, 3fCJA6, Wnt 1. MCfil, fleglA, 
AKAPM, PACL, CYP24 and KYNU) and examined using quantitative genomic 
PCR. A gene showing graded expression pattern (HRTiS), and one gene 
(PACE 4) with 3 similar chromosome location as PMl, weie used as c ontrols. 
Only frbB? and ReglA are shown. An esophageal adenocarcinoma with 
known high-level genomic amplific ation of citR? wai used as a positive con- 
hoi and normal e^phagus DNA used as ;> negative control (Cll). PC ft 
tragmrnU sires were 343 bp loi GAPDH, 166 bp tor r/bB? and 126 bp lot 



ffeglA. DNA is tiom normal lung (N) and tumor (T) from each patient {lor ex- 
ample 137). b, Immunohij toe hemic al analysts of survival related genes with 
lung adenocarcinoma mic roan ays using the tumors from this' study. "I he 
transmembrane erbB? protein (top left) e>pression is substant rally increased 
in tumor 194 containing the amplified rrbH2 gene (fig. 4o and b). Expression 
of VI CI (top right) and M00P (bottom left) was located within the neoplas- 
tic cells and the pattern ol irnmunoreactivity was consistent with the graded 
expression pattern demons! i at ed by their mRNA prolilej. Expression ol die 
oncogene ctk (bottom right) was abundantly expressed in neoplastic lung 
cells. Magnification, >400 (erbB?); >?00 (VI CI, 51 OOP and ok). 
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top survival genes (Table 1) for which specific antibodies were 
available weie chosen for immunohistochemical analysis using 
lung-tumor arrays from this study (Fig. Sb). Expression of mem- 
brane erbB2 protein was substantially increased in the er£>B2-am- 
plified tumor L94 and very low levels of expression were present 
in other tumors, consistent with mKNA expression measure- 
ments (Fig. 4a and b). CDC6 protein expression was also sub- 
stantially higher in tumor L94; consistent with rnRNA levels 
(data not shown). Expression of vascular endothelial growth fac- 
tor (VEGF) and SI OOP (Fig. Sb), as well as cy toker atin 3 8 (KRT1 8), 
* cy toker atin 7 (KRT7) and fas-associated death domain (FADD) 
protein (data not shown), was located within the lung tumor 
cells and consistent with the graded expression pattern of the 
rnRNA profiles. The oncogene ak showed both graded mRNA as 
well as a graded protein-expression pattern with survival, and 
was abundantly expressed in the tumor cells (Fig. Sb). These re- 
sults indicate that many survival-associated genes are expressed 
at the protein level and demonstrate similar mRNA and protein- 
expression patterns. 

Discussion 

We used several approaches for the analysis of gene-expression 
data related to clinicopathological variables and patient sur- 
vival. One approach, hierarchical clustering, was used to exam- 
ine similarities among lung adenocarcinomas in their patterns 
of gene expression. Previous studies of lung tumors 2122 have also 
used this method to describe subclasses. of lung tumors. Here, 
we found thjee clusters that showed significant differences with 
respect to tumor stage and tumor differentiation. This suggests, 
as expected, that tumors with similar histological features of 
differentiation demonstrate similarities in gene expression. 
This feature also partly underlies the observed statistical associ- 
ation of tumor stage and cluster, as many of the higher-stage tu- 
mors, often poorly differentiated and previously associated 
with a reduced survival* 10 , were located in Cluster 3. Although 
. this cluster, contained the highest percentage of stage 111 tu- 
mors, it also contained a nearly equal mixture of stage I and 
st3ge 111 tumors and not all tumors were poorly differentiated. 
This indicates that a subset of stage 1 lung adenocarcinomas 
share gene expression profiles with higher-stage tumo/s. 
Notably, 10 of the 1 1 stage I tumors found in Cluster 3 were the 
high-risk stage I tumors identified using the lisk index in the 
'leave-one-out' cross-validation. 

In contrast to previous analyses of lung adenocarcinomas 71 " 7 , 
we validated the expression datairom the arrays. The strong cor- 
relation of northern-blot analysis and oligonucleotide-array data 
for gene expression in the same samples (Fig. 2b) indicates that 
these studies provide robust gene-expression estimates. 
ImmunohiMochemistry using the same tumor samples in tissue 
auays demonstrates protein expression within the lung tumor 
cells. Together, these studies indicate that many of the genes 
identified using gene-expression profiles are likely relevant to 
lung adenocarcinoma. For example, )GFBP3 gene expression is 
incieased in lung adenocarcinomas (Fig. 2c). IGFBP3 protein 
modulates the autocrine or paiaciine effects of insulin-like 
growth factors, elevated IGFBP3 expression isobseived in colon 
cancer 6 , and incieased seium IGFBP3 is associated with progres- 
sion in breast cancer". Hent-shock protein 70 (MSP- 70) is in- 
creased in lung adenocarcinomas of smokers 76 and is associated 
with incieased metastatic potential in breast cancer". Incieased" 
seium lactate dehydrogenase is correlated with tumor stage and 
tumor bmden j: ', and cystatin C, a cysteine protease inhibitor ex- 



pressed in human lung cancers 31 , is prognostic in some cancers 32 . 
The decreased expression of this protease inhibitor may affect 
the invasive properties of the tumor cell. 

The cross-validation analytical strategy we used is particularly 
informative for these types of gene- expression analyses for dis- 
ease outcome 33,34 , and identification of cross- validated genes with 
a larger tumor cohort may help refine this risk index for use in a 
clinical setting. The gene- express! on data also provide opportune 
ties to observe overarching patterns that advance our under- 
standing of associations between genes and disease. For example, 
the top 100 survival genes include those involved in signaling, 
cell cycle and growth,, transcription, translation and metabolism. 
Expression of many of these genes is likely a function of increased 
proliferation and metabolism in the more aggressive tumors. 
Some genes, such as erbB2 and Reg} A (Fig. 4a and b), were highly 
over expressed in a few patients having poor survival. In one 
tumor, the erbB2 gene was amplified (Fig. So), demons! rating that 
genomic changes may underlie the over expression of a subset of 
these outlier genes. Immun ©histochemistry confirmed protein 
over expression in this patient's tumor (Fig. Sb). Notably, seven of 
the eight outlier genes were not amplified, indicating that other 
mechanisms underlie the increased mRNA expression of these 
survival-related genes. 

Most genes showed a graded relationship betiveen expression 
and patient survival. Genes such as that encoding VEGF, known 
to be strongly associated with survival in lung cancer"- 3 * were 
identified as related to patient survival in our study. VEGF 
demonstrated a graded expression pattern, as did the S100P and 
ak oncogene (Fig. Sb). S100P is a calcium-regulated protein not 
previously reported in lung cancer. The crk gene, the cellular ho- 
molog of the v- ak oncogene, is a member of a family of adaptor 
proteins involved in signal transduction and interacts directly 
with c-jun N-terminal kinase 1 tJNKl) 37 . Although ak has not 
been shown to have a role lung cancer, its role in the MAP-ki- 
nase pathway, which leads to activation of matrix metallopro- 
teinase secretion and cell invasion 38 , indicates potential 
involvement in the the tumor cell invasion or metastasis of 
some lung adenocarcinomas. Among the many genes identified 
in this study, like crk, that may be causally involved in lung can- 
cer progression (Table 1), some were related to sutyivaJ in many 
patients, and others in only smaller subsets of patients. This re- 
sult is consistent with the complex molecular architecture ol tu- 
mors in general, the heterogeneity of lung adenocarcinomas in 
particular and the multiple mechanisms underlying tumor-cell 
survival, invasion and metastasis 39 . 

Our results demonstrate that a gene-expression risk profile — 
based on the genes most associated with patient survival— can 
distinguish stage I lung adenocarcinomas and differentiate prog- 
noses. The particular genes that deline the clusters, or are associ- 
ated with survival, likely reflect the characteristics of the 
particular tumors included in the analysis. Current therapy for 
patients with stage I disease usually consists of surgical resection 
without adjuvant treatment 23 . Clearly, the identification of a 
high-risk group among patients with stage I disease would lead 
to consideration of additional therapeutic intervention lor this 
group, possibly leading to improved survival ol these patients. 

Methods 

Patient population. Sequential patients seen at the Univnitty ol Michigan 
Hojpital between May 1994 and futy ?000 lor stage I or stage HI lung ade 
no( an inoma were evaluated lor this .\ludy. Consent was leceived and the 
project was approved by the total Institutional Review Board. Primary tu- 
moi> and adjacent non neoplastic lung tissue were obtained at the time ot 
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surgery. Peripheral portions of resected Itmg carcinomas were sectioned, 
evaluated by a stody pathologist and compared with routine H&E sections 
of the same tumors, and utilized for mRNA isolation. Regions chosen for 
analysis contained a tumor. cellularity greater than 70%, no mixed histol- 
ogy, potential metastatic origin, extensive lymphocytic infiltration or fibro- 
sis. Tumors were histopathologicalry divided into two categories based on 
their growth pattern: bronchial-derived, if they exhibited invasive features 
with architectural destruction, and bronchioloalveplar, if they exhibited 
preservation of the lung architecture. Alt stage I patients received only sur- 
gical resection with intra- thoracic nodal sampling and no other treatments. 
Mage. HI patients received surgical resection plus chemotherapy and radio- 
therapy. 

Cene- expression profiling and K-/os mutation analysis. RNA isolation, 
cRNA synthesis and gene-expression profiling were performed as de- 
scribed 1 *. Details of gene annotation and K-ros mutation analysis are pro- 
vided in supplementary information. 

Northern-blot analysis. Total cellular RNA (10 pg) was separated in 1.2% 
agarose-formaldehyde gels and vacuum-transferred to Cene Screen Plus 
(NEN Life Science Products, Boston, Massachusetts). Hybridization condi- 
tions and probe labeling were as described**. Individual sequence-validated 
cDNA image clones for human ICFBP3 (clone 1407750), tDH A (clone 
2420241), cystatin C (CTS3; clone 949938) were from Research Cenetks 
(HuntsviHe, Alabama). The human histone H4 cDNA and the 28$ ribosomal 
RNA 26-mer oligonucleotide probe were prepared and labeled as de- 
scribed 40 . 

Cene- amplification analysis. 1 1 genes were selected for the analysis of ge- 
nomic alterations. Primers were designed using PrimerSelect 4.05 Windows 
32 software (DNA5TAR, Madison, Wisconsin), avoiding pseudogenes or po- 
tential homologous regions. Forward and reverse primers for the genes are 
provided (Supplementary Methods online). Quantitative genomk-PCR was 
then applied and analyzed as described 0 . 

Immunohistochemical staining. The HfaE-stained slides ot alt primary 
lung tumors were used to identify the most representative regions of each 
tumor and a tissue microarray (TMA) block was constructed as described 0 . 
Immunohistocbemistry (IHC) was performed using both routine and sec- 
tions from the TMA block as described 7 *. Detailed methods and the con- 
centrations used lor all antibodies are provided in the Supplementary 
Methods. 

Statistical methods. Mests were used to identify differences in mean gene- 
expression levels between comparison groups. Agglomeratrve hierarchical 
clustering*' was applied using the average linkage method to investigate 
whether there was evidence for natural groupings ol tumor samples based 
on correlations between gene-expression profiles. To investigate the ro- 
bustness of the cfustering inference, gene-e>pression values were per- 
turbed by adding random Gaussian error of magnitude obtained from a 
duplicate sample to each data point and then reclustered to determine con- 
cordance in the tumor's class membership. Pearson, ann< I'sher's e*act 
tests were used to assess whether cluster membership was associated with 
physical and genetic characteristics ol the tumors. 

To determine whether gene-expression profiles were associated with 
variability in survival times, 2 separate but complementary approaches 
were used. In the first approach, the 86 tumors were randomly assigned to 
equivalent training and testing sets consisting c>t equal numbers ol stage I 
and lit tumors in order to validate a novel risk-index (unction that captured 
the effect ol many genes at once, In the second approac h, c ross- validation" 
was used to more robustly identify the genes a>ioi rated with survival. 
Briefly, a 'leave-one-out' cross-validation procedure in whk h 85 ot the 86 
tumors (the training set) was used to identify qrnes that were univanately 
assoc rated with survival. The risk index was delincd as a linear combination 
ot the gene-expression values for the top genes identified by univariate Cox 
proportional- hazard regression modeling* 5 , weighted by their estimated re- 
gression coetttt rents. Kaplan- Meier survival plots and Jon- rank tests were 
then used to assess whether the risk- index assignment to high/low cate- 
gories was validated in the test set. A more (retailed description is provided 
(Supplementary Methods online). 



Note: Supplementary information is ovoilobfe on the Nature Medicine website. 



Acknowledgments 

We thonk D. Sanders for technical assistance; D. Sing for assistance with the 
figures; ond C. Omenn for aitkol reading of this manuscript This work was 
supported by Notional Cancer Institute grant: U19 CA-8S9S3 ond the Tissue 
Core of the University of Michigan Comprehensive Concer Center (NIH CA- 
46952). 

Competing interests statement 

The outhors declore that they hove no competing financiol interests. 
RECEIVED 5 APRIL; ACCEPTED 14 JUNE 2002 

1. Fry, WA, Phillips, |.L. & Menck, H.R. Ten-year survey of lung cancer treatments 
and survival in hospitals in the United States. Concer 86, 1867-1 876 (1999). 

2. Williams, D.E. et ol Survival of patients surgically treated for stage I lung cancer! /. 
Thoroc. Cordibvosc. Surg. 82, 70-76 (1981). 

3. Patrolero, P.C. et ci. Postsurgical stage I bronchogenic carcinoma: Morbid implica- 
tions ot recurrent disease. >4nn. Thorac. Surg. 38, 331-338 (1984). 

4. Naruke, T. et ot. Prognosis and survival rn resected carcinoma based on the new in- 
ternational staging system. /. Ihoroc. Cordiovosc. Surg. 96, 440—447 (1988). 

5. Kaisermann, M.C. et ol. Evolving features of (ung adenocarcinoma m Rio de Janeiro, 
Brazil. Oncol. Rep. 8, 189-192 (2001). 

6. Rogglt, V.L. et ol. Lung cancer heterogeneity: A blinded and randomized study of 
100 consecutive cases. Hum. Pothot. 16,569-579 (1985). 

7. Gail, M.H. et ol. Prognostic factors in patients with resected stage 1 norv small celt 
lung cancer: A report horn the lung Cancer Study Croup. Concer 54, 1802-1813 
(1984). 

8. TaVise, A. ef ot. Histopathologic prognostic factors in adenocarcinomas of the pe- 
ripheral lung less than 2 cm in diameter. Concer 61, 2083-2088 (1988). 

9. khinose, Y. et ol. Is T factor of the 1MN staging system a predominant prognostic 
factor in pathologic stage I non- small cell lung cancer, f. Ihoroc. Cordtovon. Surg. 
106,90-94(1993)., 

10. Harpole, D.M. et ot. A prognostic model of recurrence and death in stage t non- 
small celt lung cancer utilizing presentation, hislopathology, and oncoprotein ex- 
pression. Concer ffrs. 55, 51-56 (1995). 

11. Rodenhuis, S. et oi Mutational activation of the K-/c» oncogene: A possible patho- 
genic factor in adenocarcinoma of \he lung. N. tngl. }. Med. 317, 929-935 (1987)! 

12. Sleboj, R.JX. er ol. K-ros oncogene activation as a prognostic marker in adenocarci- 
noma of the rung. N. tngl. f. Med. 373, 561-565 (1990). 

13. Horio, Y. ef ot. Prognostic significance of p53 mutations and 3p deletions in primary 
resected norv small cell lung cancer. Canter ftes. 53, 1—4 (1 993). 

14. Kern, I.A. er ol. GerfcB-2 opression and codon 12 K-ros mutations both predict 
shortened survival tor patients with pulmonary adenocarcinomas./. On. Jnvpsf. 93, 
516-520(1994). 

15. ibina, M.'ef ot. Relationship ol p53 overoptesson and op-regulation of proliferat- 
ing cell nuclear antigen with the clinical course of non-small celt rung cancer. 
Cower flrs. 54, 2496-2503 (1994). 

16. Mehdi, 5_A. et ot. Pi ognostic mar Vers in resected stage I and H non- small cell lung 
cancer: an analysis of 260 patients with 5 year foflow-up. Clin, lung Comer 1, 
59-67(1997). 

17. Schneider, P.M. er ot. Multiple molecular marker testing (p53, C-Ki- toy, c-erbB-2) 
improves estimation ot prognosis in potentially curative resected non- small cell 
lung cancer. Bf. \. Contu B3, 473-47? (2000). 

18. Merbst, R.5. et ot. Differential e>press»on ol E-cadhetin and type IV collagenase 
genes pr edit Is outcome in patients with stage I non- small cell lung carcinoma. Ci7n. 
Con. fles. 6, ? 90-797 (7000). 

19. liotta, I. b Petmion, t. Molecular profiling ol human cancer. Nature Rev. Cenet. 1, 
48-56 (7000). 

70. Golub, l.R. tditoiral: Genome- wide views of cane er. N. tngl. /. Med. 344, 601-607 
(2001). 

21. Bhatlacharjre, A. ft ot. Classification of human lung carcinomas by mRNA expres- 
sion profiling reveals diilim r adenocarcinoma subclasses. Pro(. Notl. Mod. Sri. USA 
98, 13790-13795(7001). 

22. Garber, M.F.. ft ot. Diversity of [jent expression in adenocarcinoma ot the lung. 
Piot. Noli Atod. Sd 1/5.4 98, 13784-13789 (7001). 

73. Mills, N.L el of. Increased prevalenc e of K-;os oncogene mutations in lung adeno- 
carcinoma. Com ft Kei. 55. 1444- 1447 (1995). 

24. Giordano l.|_ tt ot. Organ-spec ific rnolec ufar classification of lung, colon and ovar- 
ian adenocarcinomas using yene expression profiles. Am. J. Pothol. 159. 
1231-1738 17001). 

25. Albeitson, D.G. rl ot. Quantitative mapping ol arnplicon structure by array CGH 
identities CYP74 as a < andidate oncogene. Notutr Cenet. 75, 144-146(7000). 

26. Kansra, 5. et ol. tGIBP-3 mediates TGI {11 ptoliler alive response in colon cancer 
cells. Int.). Comet 87. 373-378 (7000). 

77. Vadgama |.V, rt ol. Plasma inmlin. til e riiov.ih lac lorl and serum 1GI- binding pro- 
tein 3 can be assoc rated with the p'oore\sion ol breast canter, and p» edict the nil 
ot inuiitfiif and the probata lily ol survival in Ahican-Amrritan and Hispanic 



NAUJRl MIDKIMt * VOIUMI 8 - NUMh t F* P - AUf.UM 700.? 



823 



women. Ontology 57, 330-340(1999). 

28. Vblm, M.,- Mat tern, J. fx Stammler, C. Up-regulation of heal shock protein 70 in 
adenocarcinoma ol the lung in smokers. Anticancer Re*. 15, 2607-2609 (1 995). 

29. Ciocca, D.R. rt al. Heat shock protein hsp70 in patients with auxiliary lymph node- 
positive breast cancer prognostic implications, f. Nott. Cancer, frisf. 85; 570-574 
(W3). 

30. Rotenberg, Z. rt of. Total lactate dehydrogenase and its isoenzymes in serum of pa- 
tients with non-small ceD lung cancer. Clin. Chem. 34, 668-670 (1988). 

31. Krepela, £. rt of. Cysteine proteases and cysteine protease inhibitors in non-smalt 
cell rung cancer. Neoplasmo 45, 31 8-331 (1998). 

32. Kos, J. rt of. Cysteine proteinases and their inhibitors in extracellular fluids: Markers 
for diagnosis and prognosis in cancer, int. f. Bhl. Marker* 15, 84-89 (2000). 

33. Cohrb, T.R. rt of. Molecular classification of cane en Class discovery and class pre- 
diction by gene expression monitoring. Science 286, 531-537 (1999). 

' 34. Hedenfalk, I. et of Cene- expression profiles in hereditary breast cancer. N. IngL f. 
Med. 344, 539-548 (2001). 

35. Ohta, Y. el ol. Vascular endothelial growth factor and rymph node metastasis in pri- 
mary lung cancer, fir. /. Confer. 76, 1 041-1045 (1 997). 

36. Shibusa, T., Shijubo, N. & Abe, S. Tumor angiogenesis and vascular endothelial 
growth factor expression in stage I lung adenocarcinoma. Cftru Cancer Res. 4, 



1483-1487(1998). 

37. Cirardin, S.E. & Yaniv, M. A direct interaction between JNK1 and Crkll is critical for 
Racl-induced jNK activation. tMBOf. 20, 3437-3446(2001). 

38. Liu, E. rt at. The Has- mitogen- activated protein kinase pathway is critical for the ac- 
tivation of matrix metalloprotetnase secretion and the invasiveness in v-crk- trans- 
formed 3Y1 . Confer lies. 60, 2361-64 (2000). 

39. Hanahan, D. 8 Weinberg, R.A. The haflmarks.of cancer. CeQ 100, 57-70 (2000). 

40. Hanson, LA. rt of. Expression of the glucocorticoid receptor and K-rcu genes in ure- 
than- induced mouse rung tumors and transformed cell lines. Exp. Long. Hey 17, 
371-387(1991); 

41. Lin, L. et al. A minimal critical region of the 8p22-23 amplicon in esophageal ade- 
nocarcinomas defined using 5T5-ampWication mapping and quantitative PCR. in- 
cludes the GATA-4 gene. Confer Res. 60, 1341-1347 (2000). 

42. Kononen, ). et ah Tissue mic roar rays for high throughput molecular profiling of 
tumor specimens. Nature Med. 4, 844-B47 (1998). 

43. Johnson, R. H Wiener n, D.W. Applied MuliWonote Statistical Anotyih. 543-578 
(Prentice Mall, New Jersey, 1988). 

44. Stone, M. Asymplomics lor and against cross-validation. Biometriko 64, 29-38 
(1977). 

45. Cox, D.R. Regression models and life tables. J.R. Stat. Soc, 34, 187-220 (1972). 



8?4 



MATURI MIDKINI * VOlUMIfl ■ NUMFIRe 



- MKA'1'1 7007 



21 5: J Immunol M ethods. 2000 Dec 1 ;246(1 -2):79-90. Related Artides, Links 

jELSEVI 5 K| 

Rapid quantitation of proinflammatory and chcmoattractant cytokine 
expression In small tissue samples and uionocyte-derived dendritic cells: 
validation of a new real-time RT-PCR technology. 

Blaschke V , Reich K. Blasclike S , Zipprich S , Neumann C 

Department of Dermatology, von-Siebold-Str. 3, D-37075, Goettingen, Germany. 
vblasch@gwd&de 

The analysis of cytokine profiles plays a central part in the characterization of disease- 
related inflammatory pathways and the identification of functional properties of immune 
cell subpopulations. Because tissue biopsy samples are too small to allow the detection of 
cytokine protein, the detection of rnRNA by RT-PCR analysis is often used to investigate 
the cytokine milieu in inflammatory lesions. RT-PCR itself is a qualitative method, 
indicating the presence or absence of specific transcripts. With the use of internal or 
external standards it may also serve as a quantitative method. The most widely accepted 
method is quantitative competitive RT-PCR, based on internal shortened standards. 
Recently, online real-time PGR has been introduced (LightCycler), which allows 
quantitation in less than 30 min. Here, we have tested its use for the analysis of cytokine . 
gene expression in different experimental in vitro and ex vivo settings. First, we 
compared quantitative competitive RT-PCR with real-time RT-PCR in the quantitation of 
transcription levels of the CD4(+) cell-specific chemoattractant Interleukin-16 during the 
maturation of monocyte-derived dendritic cells, and found a good correlation between 
both methods. Second, differences in the amounts of IL-16 mRNA in synovial tissue 
from patients with rheumatoid arthritis and osteoarthritis as assessed by real-time RT- 
PCR paralleled differences in the level of IL-16 protein in the synovial fluid. Finally, we 
employed real-time RT-PCR to study the cutaneous expression of several cytokines 
during experimental immunomodulatory therapy of psoriasis by Interleukin-10, and 
demonstrate that the technique is suitable for pharmacogenomic monitoring. In summary, 
real-time RT-PCR is a sensitive and rapid tool for quantifying mRNA expression even 
with small quantities of tissue. The results obtained do not differ from those generated hy . 
quantitative competitive RT-PCR. 

Publication Types: 

• Evaluation Studies 
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Butyrate-induced reversal of dexametkasoue resistance in autonomous rat 
Nb2 lymphoma cells. 

Buckley AR, Krunienacker JS Buckley DJ . Lcff MA , Magnuson NS . Rccd JC , 
Miyashita T . de Jong G , Gout PVV . 

Department of Pharmacology and Toxicology, University of North Dakota School of 
Medicine and Health Sciences, Grand Forks 58202-9037, USA. 
abuckIey@mail.med.und.nodak.edu 

The parental rat Nb2 lymphoma is a prolactin (PRL)-dependent T cell line. Exposure of a 
PRL-independent subline, Nb2-SFJCD1 , to sodium butyrate (NaBT) causes transient 
reversal of their growth factor-independent proliferation in association with constitutive 
expression of protooncogenes pim-1 and c-myc. In the present study, we investigated the 
effect of NaBT treatment on the sensitivity of Nb2-SFJCD1 cells to dexamethasone 
(DEX)-induced apoptosis. Pretreatment with NaBT (2 rnM, 72 h) partially reversed 
resistance to apoptosis in Nb2-SFJCD1 cells exposed to DEX (100 nM) for 12 h, 
assessed by flow cytometric analyses of DNA fragmentation. However, the cytolytic 
effect b f DEX was abrogated by PRL in a time- and concentration-dependent manner. 
Evaluation of apoptosis -associated gene expression in NaBT-pre-treated cultures 
incubated with DEX or DEX+PRL indicated that the apoptosis resistance did not stem 
from altered bcl-2 or bax expression. However, there was a strong correlation between 
• the resistance to DEX-activated apoptosis and their enhanced expression of pim-1 mRNA 
and protein. The results show that it is possible to reverse DEX-induced apoptosis of Nb2 
pre-T cells and suggest the pim-1 gene product has an important role as a suppressor of 
this process, perhaps functioning as a mediator of PRL action. 
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Alterations in neuropeptide Y levels and Yl bindiug sites in the Flinders 
Sensitive Line rats, a genetic animal model of depression. 

Cabei lotto L , Jimenez P. Overstreet PR Kurd YL , Mathe AA , F uxe K. 

Department of Neuroscience, Karolinska Institute, Stockholm, Sweden. 

Previously, we observed specific alterations of neuropeptide Y (NPY) and Yl receptor 
mRNA expression in discrete regions of the Flinders Sensitive Line rats (FSL), an animal 
model of depression. In order to clarify the correlation between mRNA expression and 
protein content, radioimmunoassay and receptor autoradiography were currently 
performed. In the FSL rats, NPY-Hke immunoreactivity (NPY-LI) was decreased in the 
. luppocampal CA region, while Yl binding sites were increased; NPY-LI was increased 
in the arcuate nucleus. Fluoxetine treatment elevated NPY-LI in the arcuate and anterior 
cingulate cortex and increased Yl binding sites in the medial amygdala and occipital 
cortex in both strains. No differences were found regarding the Y2 binding sites. The 
results demonstrate a good correlation between NPY peptide and mRNA expression, and 
sustain the possible involvement of NPY and Yl receptors in depression. 
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Neurokinin 1 receptor and relative abundance of the short and long 
jsoforrns in the human brain. 

CabcrloHo L , Hurd YL , IVfurdock P , WatiHn JT, Mclotto S , Gorsi M , CaricUi R . 

Department of Biology, Psychiatry CEDD, GlaxoSmithKline Medicine Research Centre, 
Verona, Italy, Laura.L.Caberlotto@gsk.com 

Substance P exerts its various biochemical effects mainly via interactions through 
neurokinin- 1 receptors (NKl). Recently, theNKl receptor has attracted considerable 
interest for its possible role in a variety of psychiatric disorders including depression and 
anxiety. However, little is known regarding the anatomical distribution of NK1 in the 
human central nervous system (CNS). Riboprobe in situ hybridization, quantitative PCR 
and in vitro autoradiography were performed. Highest NK1 mRNA levels were localized 
in the locus coeruleus and ventral striatum, while moderate hybridization signals were 
observed in the cerebral cortex (most abundant in the visual cortex), hippocampus and 
different amygdaloid nuclei. Very low levels of the NK1 mRNA were detected in the 
cerebellum and thalamus. In view of the existence of a long and short isoform of the NK1 
receptor, it was of interest to assess whether there was a differentialdistribution of the 
two splice variants in the human CMS and peripheral tissues. A quantitative TaqMan 
PCR analysis showed that the long NK1 isoform was the most prevalent throughout the 
human brain, while in peripheral tissues the truncated form was the most represented. 
3H-Substance P. autoradiography revealed a good correlation between receptor binding 
sites and NFCl mRNA expression throughout the brain, with the highest levels of binding 
in the locus coeruleus. These results provide the anatomical evidence that the NK.I 
receptors have a strong association with neuronal systems relevant to mood regulation 
and stress in the human brain, but do not suggest a region-specific role of the two 
isoforms in the CNS. 
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Characterization of cyclin D2 expression in human endometrium. 
ChmD, VoonS, Lcc E , Hwang S , SoagS, tvim J , Voon BtC LecJH , 

. Department of Obstetrics and Gynecology, Samsung Medical Center, Sungkyunkwan 
University School of Medicine, Seoul, South Korea, dschoi@smc.samsung.co.kr 

OBJECTIVE: This study was undertaken to investigate cyclin D2 mRNA and protein 
expression in human endometrium during the menstrual cycle. METHODS: Endometrial 
samples were obtained from 15 premenopausal nonpregnant women who had 
hysterectomies for benign gynecologic reasons. They were divided into the following five 
groups according to histologic dating: early proliferative (n = 3), mid to late proliferative 
(n = 3), early secretory (n - 3), mid secretory (n - 3), and late secretory (n - 3). Cyclin 
D2 rnJINA and protein expression were analyzed using reverse transcriptase-polymerase 
chain reaction, Western blotting, and immunohistochemistry. RESULTS: Cyclin D2 
mRNA and protein were expressed in human endometrial tissue throughout the menstrual 
cycle. Gyclin D2 mRNA and protein expression of proliferative phase endometrium were 
significantly higher than those of secretory phase endometrium (P <.05). The staining 
intensity of cyclin D2 in proliferative phase endometrium was higher than that in 
secretory phase (P <.05). Cyclin D2 mRNA level showed good correlation with cyclin 
D2 protein level (R = 0.579, P <.03), and cyclin D2 protein also showed good correlation 
with immunohistochemical staining intensity (R = 0.562, P < 03). CONCLUSION: 
Cyclin D2 was expressed in human endometrium tliroughout the menstrual cycle. Cyclin 
D2 mRNA and protein were expressed at high levels in proliferative phase endometrium, 
especially in the early proliferative phase, and then decreased in die secretory phase. 
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Human chorionic gonadotropin beta expression in malignant Barrett's 
oesophagus. 

Couvclard A , Paraf F , Vidaud D , Dubois S , Vtdaud M Flejou JF . Dcgolt C . 

Service d 1 Anatomic Pathologique, Hopital Beaujon, 92 II 8 Clichy cedex, France. 
aiuie.couvelaid@bjn.ap-hop-paris.fr 

BACKGROUND: Human chorionic gonadotrophin beta (hCGbeta) is expressed in 
several non-trophoblastic tumours, and this is usually associated with aggressive 
behaviour. Little is known about hCGbeta expression in Barrett's adenocarcinoma. 
MATERIALS AND METHODS: We determined the hCGbeta profile in a large series of 
surgically resected Barrett's adenocarcinoma (a) at mRNA level using real-time 
quantitative reverse-transcription polymerase chain reaction analysis and (b) at protein 
level using unmunohistochemistry with a polyclonal antibody and with a monoclonal 
antibody specific for free hCGbeta. We then sought' links between the hCGbeta protein 
expression pattern and clinical and pathological parameters, including patient outcome as 
well as vascular endothelial growth factor (VEGF) expression. RESULTS: hCGbeta 
protein expression was observed in 43 of 76 (57%) Barrett's adenocarcinomas. We 
showed a strong correlation between hCGbeta protein abundance and CGB mRNA level. 
We observed a statistical link between hCGbeta protein expression and infiltrative 
tumour type ( P=0.023), perineural neoplastic invasion ( P-0.007) and VEGF protein 
expression ( P=0.016). hCGbeta expression tended to be associated with a poor outcome 
(16% versus 36% survival 8 years after resection); CONCLUSION: Expression of 
hCGbeta correlates with specific infiltrative characteristics and is associated with higher 
VEGF expression. Both molecules may play a co-ordinated role in the development of 
Barrett's adenocarcinomas. 
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Dowuregulalion of ENaC activity arid expression by TNF-alpha in alveolar 
epithelial cells. 

Dagenais A, Frechette R . Yamagata Y . Yamagata T . Carmel JF , Clermont ME , 
Brochiero E . Masse C . Berthtamue Y . 

Ceatre de recherche, CHUM-Hotel-Dieu, 3850 St-Urbain, Montreal, Quebec, Canada 
H2WlT7.andre.dagenaisxhum@ssss.gouv.qc.ca 

■Sodium absorption by an amiloride-sensitive channel is the main driving force of lung 
liquid clearance at birth and lung edema clearance in adulthood. In this study, we tested 
whether tumor necrosis factor-alpha (TNF-alpha), a proinflammatory cytokine involved 
in several lung pathologies, could modulate sodium absorption in cultured alveolar 
epithelial cells. We found that TNF-alpha decreased the expression of the alpha-, beta-, 
and gamma-subunits of epithelial sodium channel (ENaC) mRNA'to 36, 43, and 16% of 
the controls after 24-h treatment and reduced to 50% the amount of alpha-ENaC protein 
in these celts. There was no impact, however, on alpha(l) and beta(l) Na(+)-K(+)- 
ATPase rnRNA expression. Amiloride-sensitive current and ouabain-sensitive Rb(+) 
uptake were reduced, respectively, to 28 and 39% of the controls. A strong correlation 
was found at different TNF-alpha concentrations between the decrease of amiloride- 
sensitive current and alpha-ENaC rnRNA expression. All- these data show that TNF- 
alpha, a proinflammatory cytokine present during lung infection, has a profound 
influence on the capacity of alveolar epithelial cells to transport sodium. 
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Involvement of the CCND1 gene iu hairy cell leukemia. 

dc Hoct CJ , KJuiu-Nelcmans JC . Dreef E , Kcstcr MG , Khun PIVI, Sclnmring.E, van 
Kriekea JH . 

Department of Pathology, University of Leiden, The Netherlands. 

BACKGROUND: Previous results suggested increased mRNA expression of CCNDt in 
hairy cell leukemia (HCL). The CCNDl gene is involved in the t(l I;l4)(ql3;q32) 
chromosomal rearrangement, a characteristic abnormality in mantle cell lymphoma 
(MCL). We and others reported that, in contrast to other B-cell lymphomas, almost all 
MCL have over-expression of the CCNDl gene with a good correlation between RNA 
and protein analysis. Recent studies showed that overexpression of the cyclin Dl protein 
can be easily detected by immunohistochemistry (IHC) on formalin-fixed, paraffin 
embedded tissues. PATIENTS AND METHODS: To investigate whether the CCNDl 
gene is involved in HCL, we performed IHC on a series of 22 cases using formalin-fixed 
paraffin embedded splenectomy specimens. For IHC the sections were boiled in citrate 
buffer. The presence of rearrangements within the BCL-l locus and the CCNDl gene 
was analyzed in 13 of 22 cases by Southern blot analysis using all available break-point 
probes. Expression of CCNDl was analyzed at the mRNA level (Northern blot) and 
protein level (IHC). RESULTS: Overexpression of the cyclin Dl protein using IHC was 
observed in all cases, with strong expression in 5 cases. Pre-existing B- and T-cell areas 
of the spleen did not express significant levels of the cyclin Dl protein. Seven of 9 cases 
analyzed by both IHC and Northern blotting showed overexpression of the CCNDl gene 
with both methods. No genomic abnormalities were observed in any of the 13 cases 
studied by Soudiern blot analysis. Additionally, no I lql3 abnormalities were detected by 
banding analysis of 19 of 22 cases. CONCLUSIONS: The elevated levels of CCNDl 
mRNA and protein in conjunction with the absence of overt rearrangements within the 
BCL-l locus distinguish HCL from MCL and other B-cell malignancies. This suggests 
that activation of the CCNDl gene in HCL is due to mechanisms other than chromosomal 
rearrangement. 
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lahibiii and activiu production aud subunit expression in human placeulal 
cells cultured in vitro. 

Dehicvc F ; Pampfcr S , Thomas K . 

Department of Obstetrics and Gynecological Endocrinology, Universite Catholiquede 
Louvain, 1200 Brussels, Belgium. 

Inhibins and activins are dimeric proteins, with each subunit being one of three related 
protein subunits (alpha, betaA or betaB). The inRNA levels of these subunits were 
studied quantitatively during in-vitro differentiation of human cytotrophoblast cells into 
syncytium, using Northern blot analysis and semi-quantitative reverse transcription- 
polymerase chain reaction (RT-PCR) analysis. The corresponding protein concentrations 
were determined by specific enzyme-linked hnmunosorbent assays for inhibin A, B, pro 
alphaC and activin A in cellular protein extracts and culture medium (n = 5). 
Immunofluorescence studies showed syncytium formation after 48 h. The alpha subunit 
was present before plating and increased at 48 h (PO.001) while the betaA subunit was 
weak before plating and increased at 24 h. The betaB subunit was not detected. With 
respect to corresponding protein synthesis, inhibin A (alpha + betaA) had risen after 48 h 
in cellular protein extract and after 72 h in culture medium, while activin A (betaA + 
betaB) was detected after 24 h, widi no significant variations in culture medium. There 
was a good correlation between inhibin A and alpha subunit expression (r = 0.736, 
P<0.00 1), as well as between activin A and betaA subunit expression (r = 0.755, 
PO.001). This study showed that mRNA expression parallels protein synthesis of inhibin 
and activin in trophoblast cells. Inhibin A synthesis appears to be dependent on alpha 
subunit mRNA expression, rather than on the betaA subunit which controls activin A 
synthesis. This study has also shown that isolated cytotrophoblast cells do not produce 
dimeric inhibin. However, during the transformation of cytotrophoblast cells into 
syncytium, betaA subunit mRNA expression may be an indicator of cell aggregation, 
while alpha subunit mRNA expression may be an indicator of cell fusion. 
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Expression of membrane-type matrix metalloproteinases 4, 5, and 6 in 
mouse corneas infected with P. aeruginosa. 

Dong Z , KatarM , Alousi S , Berk RS . 

Department of Immunology and Microbiology, Wayne State University School of 
Medicine, 540 E. Canfield, Detroit, MI 48201, USA. 

PURPOSE: To investigate the expression and regulation of membrane-type matrix 
metalloproteinases (MT-MMPs) 4, 5, and 6 in the mouse corneas infected with 
Pseudomonas aeruginosa. METHODS: C57BL/6J mice were intracorneally infected with 
P. aeruginosa. The expression of MT4-, MT5-, and MT6-MMP was detected at both the 
rnRNA and protein levels by RT-PCR and immunoblot analysis. Immunohistochemical 
staining was performed to .localize the expression of MT4- and MT5-MMP in the mouse 
corneas, RESULTS: Expression of MT4- and MT5-MMP was detected in the normal 
(uninfected) cornea by RT-PCR and immunoblot analysis. When infected with P. 
aeruginosa, the corneas showed significant induction of each MT-MMP. Localization of 
MT4- and MT5-MMP revealed that the expression of MT5-MMP was restricted to the 
epithelial tissue in the normal cornea, whereas the induced expression of MT4- and MT5 
MMP was predominantly in the substantia propria, which contained most of the 
infiltrating cells. MT6-MMP expression was not detected in the uninfected cornea but 
was upregulated in the'infected corneas. CONCLUSIONS: Expression of MT4-, MT5-, 
and MT6-MMP was induced in corneas infected with P. aeruginosa. 
Immunohistochemistry showed predominant immunoreactivity of MT4- and MT5-MMP 
in the substantia propria. Previous histologic studies have revealed different patterns of 
inflammatory cell infiltration with an increased number of polymorphonuclear 
neutrophils (PMNs) during the early stage of inflammation and increased macrophages 
during the late stage. These results indicate a good correlation between the 
overexpression of the MT-MMPs in the infected corneas and the inflammatory response- 
that is, leukocyte infiltration-indicating that inflammatory cells such as macrophages and 
PMNs may play a role in the upregulation of MT-MMPs during corneal infection, which 
in turn can cause the destruction of corneal tissue. 
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Assessment of proliferative activity in colorectal carcinomas by 
quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). 

Duchrow M Hascmcvcr S , Broil R , Bruch Ul\ VVtndhovcI U . 

Surgical Research Laboratory, Surgical Clinic, Medical University of Lubeck, 
Ratzeburger Allee 160, D-23538 Lubeck, Germany. 

The monoclonal antibody Ki-67 and the isospecific monoclonal antibody MIB-1 are 
routinely used in oncology to assess the proliferation index of tumor cells. A more 
objective and sensitive method is the determination of the of Ki-67 protein-specific 
mRNA by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). In 25 
resected colorectal adenocarcinomas of different stages and grades we determined 
between 0.2 and 4.4 amol (I0(-18) mol) Ki-67 protein-specific mRNA per microgram 
total RNA (median = 0.88 amol). The corresponding Ki-67 indices (expressing the 
percentage of Ki-67/MIB-l positive tumor cells) ranged from 4 1 to 8 1 % (median = 6 1 %). 
We found a good correlation between Ki-67 index and mRNA expression (r = 0.75), a 
significant correlation between both data and tumor stage (primary tumor, regional nodes, 
metastasis [pTNM] staging classification) (p < 0.001), but not between both data and 
tumor grade. Both Ki-67 indices (p = 0.05) and mRNA levels (p = 0.014) correlated 
significantly to the patients' survival. These results demonstrate that the Ki-67 protein- 
specific quantitative RT-PCR is a useful method for the characterization of tumor cell 
proliferation. 
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Molecular characterisation of carbohydrate digestion and absorption in 
equine small intestine. 

Dyer J , Fcrnandcz-Castano Merediz E , Salmon KS . Proitdman CJ , Edwards GO , 
Shirazi-Becckcy SP . 

Department of Veterinary Preclinical Sciences, University of Liverpool, UKL. 

Dietary carbohydrates, when digested and absorbed in the small intestine of the horse, 
provide a substantial fraction of metabolisable energy. However, if levels in diets exceed 
the capacity of the equine small intestine to digest and absorb them, they reach the 
hindgut, cause alterations in microbial populations and the metabolite products and 
predispose the horse to gastrointestinal diseases. We set out to determine, at the 
molecular level, the mechanisms, properties and the site of expression of carbohydrate 
digestive and absorptive functions of the equine small intestinal brush-border membrane. 
We have demonstrated that the disaccharidases sucrase, lactase and maltase are expressed 
diversely along the length of the intestine and D-glucose is transported across the equine 
intestinal brush-border membrane by a high affinity, low capacity, Na-f7gtucose 
cotransporter type I isoform (SGLT1). The highest rate of transport is in duodenum > 
jejunum > ileum. We have cloned and sequenced the cDNA encoding equine SGLTl and 
alignment with SGLTl of other species indicates 85-89% homology at the nucleotide and 
84-87% identity at the amino acid levels. We have shown that there is a good correlation 
between levels of functional SGLTl protein and SGLTl rriRNA abundance along the 
length of the small intestine. This indicates that the major site of glucose absorption in 
horses maintained on conventional grass-based diets is in the proximal intestine, and the 
expression of equine intestinal SGLTl along the proximal to distal axis of the intestine is 
regulated at the level of mRNA abundance. The data presented in this paper are the first 
to provide information on the capacity of the equine intestine to digest and absorb soluble 
carbohydrates and has implications for a better feed management, pharmaceutical 
intervention and for dietary supplementation in horses following intestinal resection. 
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Suppressors of cytokine signaling proteins arc differentially expressed in 
TUl and Th2 cells: implications for Tli cell lineage commitment and 
maintenance. 

Egwuagu CE, YtiCR, Zhang M, Malidi RM , Kim SJ f^ryjr 

Laboratory of Immunology, National Eye Institute, National Institutes of Health 
Bethcsda, MD 20892, USA. emeka@lielix.nih.gov ' 

Positive regulatory factors induced by IL-12/STAT4 and IL-4/STAT6 signaling during T 
cell development contribute to polarized patterns of cytokine expression manifested by 
differentiated Th cells. These two critical and antagonistic signaling pathways are under 
negative feedback regulation by a multimember family of intracellular proteins called 
suppressor of cytokine signaling (SOCS). However, it is not known whether these 
negative regulatory factors also modulate Thimi2 lineage commitment and maintenance 
We show here that CD4(H-) naive T cells constitutively express low levels of SQCS I 
SOCS2, and SOCS3 mRNAs. These mRNAs and their proteins increase significantly in 
nonpolarized Th cells after activation by TCR signaling.We further show that 
differentiation into Thl orTh2 phenotype is accompanied by preferential expression of 
distinct SOCS mRNA transcripts and proteins. SOCS1 expression is 5-fold higher in Thl 
than in Th2 cells, whereas Th2 cells contain 23-fold higher levels of SOCS3 We also 
demonstrate that IL-12-induced STAT4 activation is inhibited inTh2 cells that express 
high levels of SOCS3 whereas IL-4/STAT6 signaling is constitutively activated in Th2 
cells, but not Thl cells, with high SOCS1 expression. These results suggest that mutually 
exclusive use of STAT4 and STAT6 signaling pathways by differentiated Th cells may 
derive inpart, from SOCS3- or SOCS I -mediated repression of IL-12/STAT4- or IL- 
4/STAT6 signaling in Th2 and Th I cells, respectively. Given the strong correlation 
between distinct patterns of SOCS expression and differentiation into the Th l or Th2 
phenotype, SOCS1 and SOCS3 proteins are therefore Th lineage markers that can serve 
as therapeutic targets for immune modulation therapy! 
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Intravitreal invading cells contribute to vitreal cytokine milieu in 
proliferative vitreoretinopathy. 

EI-Ghrably IA , Qua HS , Orr GM, Fischer D , Tighe PJ . 

Larry A Donoso Laboratory for Eye Research, Department of Ophthalmology, University 
of Nottingham, UK. 

AIM: To examine the contribution of infiltrating cells in the local production of cytokines 
within the vitreous of patients with proliferative vitreoretinopathy (PVR); METHODS: 
The presence of mRNA coding for IL'-6, IL-8, IL-lbeta, IL-1 alpha, TNFalpha, 
IFNgamma, IL-12, and HPRT was investigated in 25 vitreous samples from patients with 
PVR, 1 1 vitreous samples from patients with retinal detachment (RD) not complicated by 
PVR, and 10 vitreous samples from patients with macular hole (MH). A quantitative 
reverse transcriptase polymerase chain reaction (RT-PCR) using an internal competitor 
was used to investigate these samples. From these samples, 15 PVR, 8 RD, and 8 MH 
were analysed for the' protein levels of the same cytokines using enzyme linked 
immunosorbent assay (ELISA). Spearman correlation was used to test any association 
between mRNA and cytokine protein levels^as an indicator of the contribution these cells 
make to the intravitreal cytokine milieu. RESULTS: A strong correlation was found 
between mRNA and their respective cytokine levels ((protein products) for IL-6, IL-8, IL- 
lbeta, IL-1 alpha, TNFalpha, IFNgamma (Spearman r - 0:83, 0.73, 0.67, 0.91, 0.73, and 
0.73 respectively), but not for IL-12. The median levels of IL-6, IL-8, IL-lbeta, and 
IFNgamma mRNA and their respective cytokines were significantly higher (p <0.05) in 
patients with PVR than in those with macular' hole. There was no statistically significant 
difference in the median levels of IL-1 alpha- mRNA between PVR and MH but the 
cytokine IL-1 alpha was detected at a significantly higher level in PVR compared with 
MH patients. Between PVR and RD patients, there was no statistically significant 
difference in mRNA levels for all the investigated cytokines (p >0.05) except for IL-6 
where, there was a statistical significance (p= 0.038). In contrast, the median levels of IL- 
6, IL-8, and IL-lbeta cytokines were significantly higher (p <0.05) in patients with PVR 
than in those with RD, whereas for IL-1 alpha and IFNgamma no significant statistical 
difference was detected between PVR and RD patients (p >0.05). When results of RD 
and MH patients were compared, a statistical difference was only detected in mRNA 
levels of INFgamma (p -= 0.008). However, no difference was detected forlNFgamma 
(protein product) or for any of the other cytokines between RD and MH patients. 
CONCLUSION: Levels of both protein and mRNA encoding IL-6, IL-8, IL-lbeta, and 
IFNgamma is significantly increased in vitreous samples from patients with PVR. The 
strong correlation between ELISA detectable cytokines (protein products) and their 
respective mRNA levels suggest that intravitreal, invasive cells are the major source of 
these cytokines, with the exception of IL-12. Cells invading the vitreous do not appear to 
locally produce IL-12 mRNA. This would appear to implicate cells peripheral to the 
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vitreal mass as the major source of this cytokine. 
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Modulation of the glutainatergic receptors (AMPA and NMD A) and of glutamate 
vesicular transporter 2 in the rat facial nucleus after axotomy. 

Elcore L, Vassias I T Vidal PP . de Wacle C . 

LNRS (CNRS-Paris V), ESA 7060, Centre Universitaire des Saints-Peres, 45 rue des Saints- 
Peres, 75270 Paris Cedex 06, France. 

Facial nerve axotomy is a good model for studying neuronal plasticity and regeneration in the 
peripheral nervous system. We investigated in the rat the effect of axotomy on the different 
subunits of excitatory glutamatergic AMPA (GLuRl-4), NMD A (NRI, NR2A-D) receptors, post- 
synaptic density 95, vesicular glutamate transporter 2, beta catenin and cadherin. mRNA levels 
and/qr protein production were analyzed 1, 3, 8, 3Q and 60 days after facial nerve axotomy by in 
situ hybridization and intmunohisto fluorescence. mRNAs coding for the GLuR2-4, NRI, NR2A, 
B, D subunits of glutamatergic receptors and for post-synaptic density 95, were less abundant after 
axotomy. The decrease began as early as 1 or 3 days after axotomy; the mRNAs levels were 
lowest 8 days post-lesion, and returned to normal or near norma! 60 days after the lesion. The 
NR2C subunit mRNAs were not detected in either lesioned or intact facial nuclei. 
Immunohistochemistry using specific antibodies against GLuR2-3 subunits and against NRI 
confirmed this down-regulation. There was also a large decrease in vesicular glutamate transporter 
2 immunostaining in the axotomized facial nuclei at early stages following facial nerve section. In 
contrast, no decrease of NR2 A subunit and of post-synaptic density 95 could be detected at any 
time following the lesion, beta Catenin and cadherin immunoreactivity pattern changed around the 
cell body of facial motoneuron by day 3 after axotomy, and then, tends to recover at day post- 
lesion 60 days. Therefore, our results suggest a high correlation between restoration of 
nerve/muscle synaptic contact, synaptic structure and function in facial nuclei. To investigate the 
mechanisms involved in the change of expression of these proteins following axotomy, the facial 
nerve was perfused with tetrodotoxin for 8 days. The blockade of action potential significantly 
decreased GLuR2-3, NRI and NR2A mRNAs in the ipsilateral facial nuclei. Thus, axotomy- 
induced changes in mRNA abundance seemed to depend partly on disruption of activity. 
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Altered levels of scavenging euzyuies in emb.yos subjected to a diabetic 
environment. 

Forsberg H, Coc^XA, Cag!icro E, Eriksson U.f 

Department of Medical Cell Biology, University of Uppsala, Sweden. 

Maternal diabetes during pregnancy is associated with an increased rate of congenital 
maiformafons m the offspring. The exact molecular etiology of the dis JroeT 
embryogenes,s ,s unknown, but an involvement of radical oxygen species in the 
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m vrvo, or high glucose m vitro alters the expression of the free oxygen radical 
scavengmg enzymes superoxide dismutase (CuZnSOD and MnSOD), catalase and 
glutathione perox.dase m rat embryos during late organogenesis. We tudied oZring of 
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Oxytocin receptors ia boviue cervix: distribution and geue expression 
during the cstrous cycle. 

Euchs_AR, IveUR, Ficjds_PA, Chang SM . FiVjrt^JUjr 
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Oxytocin (OT) receptor (OTR) concentrations were determined in the cervix of 
nonpregnant cows on cycle Days 0, 3, 7-8, 17, and 19 (n = 3-4 cows each day)- [3HJOT 
was used as the labeled Hgand. Mucosal and muscle layers of the cervix were also 
analyzed separately for both ligand binding and expression of the OTR gene using a 
newly developed RNase protection assay (RAP). Cellular localization of OTR protein 
was deterrmned by .rranunohistochemistry. All regions of cervix from cows at estrus had 
high concentrations of OTR; in the luteal phase, all were sharply down-regulated At 
™? I™, had 3b0Ut 3 °- foId highef <»trations than the muscle layer. 

Zll^ . W3S ^ Y J?*?* by b thG mUCOSa from cows - white much 

weaker signals were found m the muscle. On Days 7-17, the OTR mRNA signals in both 

ST ^ 7? de / ere ^ faint or "ondetectable. Thus, there was a good correlation 
between hgand binding and mRNA expression, which suggest, that OTR concentrations 

oHI^ I'" 811 / f™?"^™ 1 ,evei - The epithelial cells at the luminal surface 
of the mucosa were thepnnc.pal site of immuno reactive OTR; muscle cells showed 

pfoS 7 ^ ? gn3lS - Previouslv > OT was ^ to stimulate prostaglandin (PG) 
E2 output rn vuro m bovme cervical tissues. Since PGE2 is capable of softening the 

S^S*? SUggeSt that OT mav have a novel physiological function to cause 
.softening of the bovine cervix mediated by the release of PGE2. 
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A sampling of the yeast protcome. 

Futchcr B , Latter GL Monardo P , McLaughlin CS , Garrets Jt . 

Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 1 1724, USA. 
futcher@cshl.org 

In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered quantitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance and, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were identified. The behavior of proteins in differential centrifugation experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast prqteome. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous range of protein abundance 
and, for identified spots, a good correlation between protein abundance, roRNA abundance, and codon bias. 
For each molecule of well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and etbanol media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproteins were identified. The behavior of proteins in 
differential centrifugalion experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The nexl logical level 
of analysis is that of the expressed set of proteins. We have 
begun to analyze the yeast proteome bv using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
weJJ-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
3nd the relative lack of in irons in S. cerevisiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are. three pillars on which such a database rests: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for 5. cerevisiae have been described else- 
where (7, 8). Here we describe quantitative data for these 
proteins under a variety of expeiimcntal conditions. 

MATERJALS AND METHODS 
Strains and inriJia. 5. fereviuae W3D3 [MATn ode2-l his3 I? f J5 leu 2- 3, ))2 
irph} ura3-l can 1-100) «as used (26). - Mt l YND (yeast nitrogen base) medium 
was 1.7 g of YNB (Dilco) pet liici. 5 g of ammonium sulfate per liter, and 
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adenine, uracil, and all amino acids except methionine; -Met -Cys YNB me- 
dium was the same but without methionine or cysteine. Medium was supple- 
mented wilh 27c glucose (for most experiments) or with 2% ethanol (for ethano) 
experiments). Low-phosphate YEPD was described by Warner (23). 

Jsolopic labeling of yeast and preparation of cell extracts. Yeast strains were 
labeled and proteins were extracted as described by Garrels el al. (7, 8). Briefly, 
cells were grown to 5 X 10* cells per ml. al 30*C; J ml of culture was transferred 
to o fresh tube, and 03 mCi of ( ?5 SJ methionine (e.g., Express protein labeling 
mix; New England Nuclear) was added to this l-ml culture. The cells were 
incubated for a further 10 to J 5 min and then transferred to a J-5-mJ microcen- 
trifuge tube, chilled on ice, and harvested by cent rifuga lion. The supernatant was 
removed, and the cell pellet was resnspended in 100 pi of lysis buffer (20 mM 
Tris-HCI [pH 7.6), 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA. 
0.1% deortycholate: just before use, pbenylmelhylsulfonyl fluoride was added to 
I mM, leupeptin was added to I ng/ml, pepstatin was added to 1 ng/mL losyl- 
sulfonyt pherrylalanyl chloiomethyl ketone was added to 10 jig/ml, and soybean 
trypsin inhibitor was added to 10 u-g/ml). 

The resuspended cells were transferred to a screw-cap J.5-ml polypropylene 
tube containing 0.23 g of glass beads (0.5-rom diameter; Biospec Products) or 
0.40 g ofzirconia beads (0J>-mm diameter; Biospec Products). After the cap was 
secured, the lube was inserted into a MiniBeadbealer 8 (Biospec Products) and 
shaken at medium high speed at 4"C for I min. Breakage was typically 75%. 
Tubes were then spun in a microcentrifuge for JO s at 5,000 X g at 4°C. 

With a very fine pipette tip, liquid was withdrawn from Ihe beads and trans- 
ferred to a prechilled JJ-rm* tube containing 7 uJ of DNase 1 (0^ mg/ml; Cooper 
product no. 6330)-RNase A (0.25 mg/ml; Cooper product no. 5679)-Mg (50 mM 
MgCI 2 ) mix. Typically 70 of liquid was recovered. The mixture was incubated 
on ice for JO min to allow the RNase and DNase to work. 

Nest, 75 p.1 of 2 X dSDS (2X dSDS is 0.6% sodium dodecyl sulfate JSDSJ, 2% 
meicaptoethanol, and 0. 1 M Tris-HCI |pH 8}) was added. The lube was plunged 
into boiling water, incubated for 1 min, and ihen plunged into ice. After cooling, 
the tube was centrifuge*) at 4°C for 3 min at 14 f 000 x g. The supernatant was 
transferred to a fiesh tube and frmen at -70°C About 5 pi of this supernatant 
was used for each 2D gel. 

2D piilyarrylamidr pi Is. 2D gels were made and run as described elsewhere 
(6-8). 

Imagr analysis of tnr gels. The Quest H software system was used for quan- 
titative image analysis (20, 22). Two techniques were used to collect quantitative 
dara for analysis by Quest J I sofrware. Fits!, before the advent of phospbOT im- 
agers, gels were dried and ftuorogiaphed. Each gel was exposed to film for three 
di tie rent times (typically I day, 2 weeks, and 6 weeks) to increase the dynamic 
range of the data. The films were scanned along wilh calibration strips to relate ■ 
him optical density to disintegrations per minute in the gels and analyzed by the 
soJlware lo obtain a lineai relationship between disintegrations per minute in the 
spots and optical densities of the film images. The quantitative data are ex- 
pressed as ports per million of the total cellular protein. Tins value is calculated 
from Ihe disintegrations per minute of the sample loaded onto the gel and by 
compaiing the film density of carb data spot wilh density of the film over the 
calibration strips of known radioactivity ex posed to the same film. This yields the 
disintegrations pei minute per millimeter fui each spoi on the gel and ihence its 
par is- pet- minute value. 

After the advent i-l phosphor imaging, gels bearing "S- labeled pioteins weie 
exposed lo phosphoi image i screens and scanned by a Fuji phosphor imager, 
rypically lot two e.v,„,vmes per gel. Calibration snips of known radioactivity were 
oposcd simultaneously. Scan data tiom the phosphniimagei was assimilated by 
Ones! II .software, and quantitative data wrie iccordcd foi the spots on I he gels. 
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Measurements of protein turnover. Cells in exponential phase weie pulse- 
labeled with | 35 S)rnethionme, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to 160 mm (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (JO min). Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the different 
samples had different amounts of protein but equal amounts of ^S). Spots were 
quant i tat ed with a phosphor imaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. Jn one of the experiments, there was one such 
spot. To the eye, this was a minor, unidentified spot seen only in the first two 
samples (10 and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, we concluded that none of the identified 
spots (and all but one of the visible spots) represented proteins with long 
half-lives. 

Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except that no detergent (i.e., no 
deoxycholate) was present in (be lysis buffer. The crude rysate was cleared of 
. unbroken cells and large debris by centrifugal ion at 300 X g for 30 s. The 
supernatant of this centrifugal ion was then spun at 16,000 X g.fos 10 min to give 
the pellet used for F»g. 6B. The supernatant of the 16,000 X g, 10- min spin was 
then spun at J 00,000 X g for 30 min to gfvc the supernatant used for Fig. 6A. 

Protein abundance calculations. A haploid yeast cell contains about 4 X~10~ 12 
g of protein (1, 15). Assuming a mean protein mass of 50 kDa, there are about 
50 X 30* molecules of protein per cell. There are about L8 methionines per 10 
kDa of protein mass, which implies 45 X 10 8 molecules of methionine per cell 
(neglecting ihe small pool of tree Met). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the iota! number of counts on each gel (by 
integrating counts over the entire gel), and (Hi) the total number of counts 
loaded on the eel (by scintillation counting of the original sample). Thus, we 
know what fraction of the total incorporated radioactivity is present m each spot. 
After correcting for ihe methionine (and cysteine [sec below J) content of each 
protein, we calculated an absolute number of protein molecules based on ihe 
fraction of radioactivity in each spot and on 50 X 10* total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, the number of cysteine molecules 
per protein was also taken into account in calculating the number of molecules 
of protein, but Cys molecules were weighted one-fifth as heavily as Met mole- 
cules. 

mRNA abundance calculations. For estimation of roRNA abundance, we used 
SAGE (serial analysis of gene expression) data (27) and Alfymetiix chip hybrid 
ization data (29a. 30). The mRNA column in Table 1 shows mRNA abundance 
calculated fiom SAGE data alone. However, the SAGE data came from cells 
growing in YEPD medium, whereas our protein measurements were from cells 
growing in YNB medium. In addition, SAGE data for low- abundance mRNAs 
suffers from statistical variation. Therefore, we also used chip hybridi2ation data 
(29a, 30) foi mRNA from cells grown in YNB. These hybridization data also had 
disadvantages. First, the amounts of high-abundance mR K/\s were systemati- 
cally underesiimated, probably because of saturation in ihe hybridizations, which 
used 10 Mg tif cRNA. For example, (he abundance of ADJ1) mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of ENG2 mRNA was 248 copies per cell by SAGE but only 41 by 
hybridization. When the amount of cRNA used in the hybridization was reduced 
"»° 1 ug, the apparent amounts of mRNA were similar to the amounts determined 
by SAGE (29a, 29b). However, experiments using 1 ug of cRNA have been done 
for only some genes (29a). Because amounts of mRNA were normalized to 
15 : 000 per cell, and because the amounts of abundant mRNAs were underesti- 
mated, tbeie is a ?.2-fold overestimate of the abundance of nonabundani 
mRNAs. W c calculated this factor of 22 by adding together the number of 
mRNA molecules Irom a large number of genes expressed at a low level toi both 
SAGE daia and hybridization data. The sum for the same genes from hybrid- 
Ration data is 2.2-fold greater than that from SAGE data. 

To take into account these difficulties, we compiled a list ol "adjusted' 1 mRNA 
abundance as follows. For all high- abundance mRNAs of our identified proteins, 
we used SAGE data. For all of these particular mRNAs, chip hybridization 
suggeMed that mRNA abundance was the same in YEPD and YNB media. For 
medium- abundance rnRNAs, SAGE data were used, but when hybridization 
data showed a significant difference between YEPD and YNB, then the SAGE 
data were adjusted by the appropriate factor. Finally, foi low- abundance 
mRNAs, we used data fiom chip hybridizations fiom YNB medium but divided . 
by 2.2 to noimalhe to the SAGE results. These calculations were completed 
without reference to protein abundance. 

CAJ. The codon adaptation inde* (CAl) was taken from the yeast proicome 
database | YPD) (13), lor which calculations were m:itlr according to Sharp jnd 
Li (24). Briefly, ihe indo uses a reference set ol highly expressed t>eneMO as.rign 
a value ro e:*ch codon. and then a score tor a gene is calculated from the 
frequency ot use of ihe various codons in that gene (24 J. 

Statistic! analysis. The J MP program was used *\\h the aid of T. Tullv. The 
JMP pjoi-ram :how ; d ilia i ncithei mRNA not pmicin abundances were nor- 
mally itisiribuled: titer i- lore, Spearman rank correlation t-oi-lfrocnrs (/,) wen- 



calculated. The mRNA (adjusted and unadjusted) and protem. data were also 
transformed so that Pearson product-moment correlation coefficienls (r p ) could 
be calculated. First, this was done by a Box-Cox transformation of log-trans- 
formed data. This transformation produced normal distributions, and an r p of 
0.76 was achieved. However, because the Box- Cox transformation is complex, we 
also did a simpler logarithmic transformation. This produced a normal distribu- 
tion foi the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical to the coefficient from (be Box-Cox 
transformed data. We therefore believe thai this correlation coefficient is not 
misleading, despite Ihe fact that the log(mRNA) distribution is not quite normal. 



RESULTS 

Visualization of 1,400 spots on three gel systems. Yeasl 
proteins have isoelectric points ranging from 3.1 to 12.8, and 
masses ranging from Jess than 10 kDa to 470 kDa. It is difficult 
to examine all proteins on a single kind of gel, because a gel 
with the needed range in pi and mass would give poor resolu- 
tion of the thousands of spots in the central region of the gel. 
Therefore, we have used three gel systems: (i) pH "4 to 8" with 
10% polyacrylamide; (ii) pH "3 to 10" with 10% poryacryl- 
amide; and (iii) nonequilibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% polyacrylamide gel. The pH 
at the basic end of ihe isoelectric focusing gel cannot be main- 
tained throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend the pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acrylamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these condi lions (27, 30). Here, we focus 
on ihe proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins ate present on ihese 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrale or nearly comigrate. When two proteins are 
resolved, but are close together; and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when the weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus, weak spots 
can be seen only when they are well separated from strong 
spots. 

For a given gel, the number of cleieciable spots initially rises 
with exposure time. However, beyond an optimal exposure, the 
number of distinguishable spois begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
At long exposures, the whole autoradiogiam turns black. Thus, 
there is an optimum exposure yielding ihe maximum number 
of spots, and at this exposure ihe weakest spots are not seen. 

Largely because of the problem of coverage, the proteins 
seen are strongly biased toward abundant proteins. All identi- 
fied proteins have a CAl of 0.18 or more, and we have iden- 
tified no Iransmption factors or ptotein kinases, which are 
nonabundani proteins. Thus, this technology is useful for ex- 
amining protein synthesis, amino acid metabolism, and glyco- 
lysis but not for examining tr:mscription ; DNA replication, or 
the cell cycle. 



Best Available Copy 




16kDa 



EFBI 



YKLM7w 
YKL056C 




B 

115 kDa 



,YEF3 



>UBAI 
.HSPICg} CDC 48 

Wi. . *rk AR2 




;EN02 : 



7DH3 



ADH2 ., _ 
5- H0M2 • >SAl ; |<V .. 



1DH2 



ADEl MOU 



GPPt:: 



,RPAO 
►*^BMH2 
jsi — BMHt 



/GSP Ladki 



.YkL066c 



G.7 



PH 



4.3 



PrMfimri-'o? ^ 'iT 0 ?' 3 ' ^ ,, ' f . i ^ te " ,ic l,Kusin * dimc »«o». s«»eich« l.om pi I 6.7 (Id.) to pH 4.3 (ugh,). The venical wis is the poh-acrylamiJe 

a L ^ u , >m al r: V ^V^ 10 ^ IO ,3U kDa {K ^>- Fw !» Ml A * "»*■■ ^ "»> "> phase in g h,co£io, £»d D 

yiown in clhynol. ipols lahi'lrJ I llnoin'h 6 ;»t unijf nti/it'd piolcini hiyltlv injured in eih;mol. 



7359 



7360 FUTCHER FT AL. 



Mol CelL Bjol 



Spot identification. The identification of various spots has 
been described elsewhere (7, 8). At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-cbase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein hajf-lives (Materi- 
als and Methods). Cells were labeled with [ 35 S]methionine for 
10 min. and then an excess of unlabeled methionine was added. 
Samples were taken at 0, 10, 20, 30, 60. and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were run, and spots were quant it a ted. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translational modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-Jives of more than 90 min. Gygi 
et al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-lives. This result does not imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same half- life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine^ and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 1 and 2; Materials and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming an average content of Met and.Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that wc are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had 8,904 ppm, the 10th 
most abundant had 2,842 ppm, the 100th most abundant had 
314 ppm, the 50Qth most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than optimum 
exposure) had 23 ppm. Thus, there is more than a 300-fold 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pll 4-8 gel. the most abundant 60 proteins account 
for 50%, and the most abundant 500 proteins account for 80%. 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins; wc estimate that half of the total 
cellular protein is accounted lor by fewer than 300 different' 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein abundance with mRNA abnndanre. 
Estimates of mRNA abundance for each gene have been made 
by SAGE (27) and by hybridization of cRNA lo oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table 1 lists the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); n second incorpo- 
rates data from boll* SAGE and hybridization (30) (adjusted 
mRNA) (Table 1; Materials and Methods). Wc correlated 
protein abundance with mRNA abundance (Fig. 2). For ad- 



justed mRNA versus protein, the Spearman rank correlation 
coefficient, r 5> was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r p> on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation, is far from perfect; for mRNAs of a given abundance, 
there is at least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due lo post transcriptional 
regulation, and some is due lo errors in the mRNA or protein 
data. For example, the protein Yefi rims poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the low protein/mRNA 
ratio of Yei3. It is the most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 limes. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of ceil protein is synthesized in about 120 
mm under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every, 2 s. This is a remarkably. high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; this in turn implies 
translation of over 20 amino acids per s, a rate considerably 
higher than estimated for mammalians (3 to.S amino acids per 
s) (38). These estimates depend on the amount of mRNA per 
cell (13, 27); 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
controlled for Jess abundant proteins. Many nonabundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptimal rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translational initiation. For example, there are over 600 yeast 
genes which probably have short open reading frames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein abundance. The 
mRNAs for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic — highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias with abun- 
dance continues for medium- abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we hnvc used the CA1 (21) (Male rials and Methods) because 
it gives a result between 0 and I. The r s for CA1 versus protein 
abundance is 0.80 (P < 0.0003). similar to the rnRNA-piotein 
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" CAI. a measure of codoii bias, is taken from the YPD mRNA. number of mRNA molecules pei cell liom SAGE data (27); adjusted mRNA, number of rnRNA 
YNil t p . i? bo, . h ' SACE :md chi P hybiidwaiion (30) (see Materials and Method*); Pioiein (Gtu), numbei nl molecules of p,oicin p« cel( in 



YNB-kIucosc; Protcm (Eth), number ot molecules ol protein per cell in YNB-eihanol; E/G ratio, ratio of prote.n abundance in e.hanol lo Wucose The E/G ratio is 
not men .1 « ^ close ,o ] or rf It was not ratable (NK) in multiple yds. Some gene products (e.g., TiM *nj Tif? | Tin,?}) Wf ,e diftkuh .0 dis.ingmsh on either 
a pi.rtcn 01 an mRNA basis; these a.e pooled. NoMu, ibere was no suitable A'//iJU site in the _V ,epion of the gene, and so there a,e no SAGE mRNA data- No Mel 
ihf mature gene pjoduci contains no methionines, and so there ...... • 
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TABLE 2. Functions of proteins listed in TabJe 1 



Name* YPD liltc lines* 



Adhl Alcohol dehydrogenase I; cytopJasmic isozyme reducing acetaldehyde lo-ethanol, regenerating NAD + 

Adb2 Alcohol dehydrogenase II; oxidizes elhanot to acetaldehyde, glucose repressed 

Cit2 Citrate synthase, peroxisomal (nbnmilochondrial); converts acety^CoA and oxaloacetate to citrate plus CoA 

Enol Enolase J (2-pbosphogrycerate dehydratase); converts 2-phospho-D-gJycerate to phosphoenolpyruvate in glycolysis 

Eno2 Enolase 2 (2-phosphogrycerale dehydratase); converts 2-phosphcHO-gJycerate to phosphoenolpyruvate in glycolysis 

FbaJ Fructose bisphosphate aldolase II;- sixth step in glycolysis 

Hxkl ' Hexokinase I; converts bexoses to hexose phosphates in glycolysis; repressed by glucose 

Hxk2 Hexokinase II; converts bexoses to hexose phosphates in glycolysis and plays a regulatory role in glucose repression 

Jell hpcilrale lyase, peroxisomal; carries out part of the gryoxylate cycle; jequired for gluconeogenesis 

Pdbl Pyruvate dehydrogenase complex, El beta subunil 

PdcJ Pyruvate decarboxylase isozyme 1 

Pfkl Phiwpbofructokinase alpha subunil, part of a complex with Pflc2p which carries out a key regulatory step in glycolysis 

Pgil Glucose -6-phosphate isomer ase, converts glucose-6- phosphate to fructose- 6-phospbate 

Pyd Pyruvate carboxylase I; converts pyruvate to oxaloacetate for gluconeogenesis 

Tall Transa Idol ase; component of nonoxidative part of pentose phosphate pathway 

Tdh2 Glycer a Id ehyde-3-phosphate dehydrogenase 2; converts D-gryceraldehyde 3-pbospha I e to J ,3-dephosphogryceralc 

Tdh3 Glyceraldehyde-3-phosphate dehydrogenase 3; converts r>grycer aldehyde 3-phosphate lo 1,3-dephosphogrycerate 

Tpil Triose phosphate isom erase; inter converts grycer aide hyde-3- phosphate and dihydroxyacetone phosphate 

Efbl Translation elongation factor EF-1B; GDP/GTP exchange factor for Tenp/Tef2p 

Efll Translation elongation factor EF-2; contains diphtbamide which is not essential for activity; identical to Efl2p 

Eft2 Translation elongation factor EF-2; contains diphtbamide which is not essential for activity; identical to Eftlp 

FrtJ Translation initiation factor eIF3 beta subunit (p90); has an RNA recognition domain 

RpaO (RPPO) Acidic ribosoroal protein AO 

Tifl Translation initiation factor 4 A (e!F4A) of the DEAD box family 

Ttf2 Translation initiation factor 4 A (eIF4A) of the DEAD box family 

Yef3 Translation elongation factor EF-3A; member of ATP-binding cassette super family 

Chaperonin homologous to E. coti HtpG and mammalian HSP^O 
Mitochondrial chaperonin that cooperates with HspJOp; homolog of E. coti GroEL 
Heat- inducible chaperonin homologous lo E coli HtpG and mammalian HSP90 

Heat shock protein required for induced Ihermotoleiancc and for rcsolubilizing aggregates of denatured proteins; important for [psi - ]- 
lo-[PSI + ] prion conversion 

Heat shock protein of the endoplasmic reticulum lumen required for protein translocation across the endoplasmic reticulum membrane 

and for nuclear fusion: member of the HSP70 family 
Cytoplasmic chaperone; heat shock protein of the HSP70 family 
Cytoplasmic chaperone; member of the HSP70 family 
Heat shock protein of HSP70 family involved in the translations! apparatus 
Heal shock protein of HSP70 family, cytoplasmic 

Mitochondrial protein that acts as an import motor with Tim44p and plays a chaperonin role in receiving and folding of protein chains 

during import; heat shock protein of HSP70 family 
Heal shock protein of the HSP70 family; multicopy suppressor of mutants with hyperaclivated Ras/cyclic AMP pathway 
Stress-induced proiein required for optimal growth at high and low temperature;'has letrnlricopeplide repeats 



Adel Phosphoribosylamidoimidazole-succmocarboxamide synthase: catalyzes the seventh step in de novo purine biosynthesis pathway 

Ade3 C, lelrahydrofolate synthase (t Afunctional enzyme), cytoplasmic 

Ade5,7 Ph ospb or ibosyla mine -glycine ligase plus phosphor ibosylformylglycinamidine eye lo- ligase; bifunction.it protein 

Ara4 Argjnjnosuccinale lyase; catalyzes the final step in arginine biosynthesis 

Gdh) Gtutamatc dehydrogenase (NADP + ); combines ammonia and a-ketoglutnrale to form glutamale 

Gin J Glut amine synthetase; combines ammonia lo glutamnte in ATP- driven je action 

His4 Phosphor ibosvl- AMP cyclohydrolase/phosphorrbosyl-ATP ppophosphohvdrolase/histidinol dehydrogenase; 2nd, 3rd, and 10th steps of 

his biosynthesis pathway " 

,lv 5 Ketol-acid* rcducioisornvruse (acclohydroxy, acid red ueioisome rase) (alpha-keio-p-hydroxylacyl) reductoisomerasc); second step in Val 

and Ilv biosynthesis pathway 

'-ys9 Saccharopine dehydrogenase ("NADP"*, K-glulamale forming) (saccharopihe reductase), seventh step in lysine biosvnthesis pathway 

*4cl6 Homocysteine methyllransferase; (5-mclhYilelrabydroplerovI Iriglutamate homocysteine mcthvllransfcrase), methionine synthase, 
cobalamin independent 

Fro2 y- Glutamyl phosphate reductase (phosphoglulamale dehydrogenase), proline biosynthelic enzyme 

Scrl Phosphoscrine transaminase; involved in synthesis of serine fiom 3-phospboglycerale 

Trp5 Tryptophan synthase, last (5 1 h) step in tryptophan biosyn thesis pathway 

Actl Actin; involved in cell polarization, endocytosis, and other evloskelela! functions 

Adfc) Adenylate kinase (GTP.AMP phosphotransferase), cytoplasmic 

Ald6 Cytosolic acetaldehyde dehydrogenase 

Atp2 Beta subunil of Fl -ATP synthase; 3 copies are found in each Fl oligomer 

Bmh] HomoJog of mammalian 14-3-3 piolein; has strong similarity to Bmh2p 

Bmh2 Homolog of mammalian 14-3-3 protein; has strong similarity to BmhJp 

Cdc48 Protein of the AAA family of ATPases: required for cell division and homotvpic membrane fusion 

Cdc60 Leucyl-iRNA synthetase, cytoplasmic 

Erg20 Famesyl pyrophosphate synthetase; may be i ale- limiting step in stciol hi osyn thesis pathway . 
Gppl (Rhr2) - dl Glycerol phosphate phosphatase 

G spl Ran, a GTP-binding piotcin ol the R:is super family involved in trafficking through nuclear pores 

Ippl Inorganic pyrophosphatase, cytoplasmic 

LcbJ Component of serine C -palmitoyltransferase; first step in biosynthesis of long-chain ba^e component of sphineolipids 

Moll (Th>4) Thiamrne-iepressed protein essential for growth in the absence of thiamine 

Pabl PnJv{A)-bindimj! protein of cytoplasm and nucleus; pail of the 3'-end RNA- processing complex (cleava»e factor IV has 4 RNA 

recognition domains ° 

Fsal Mannose-J- phosphate guanyltransferasc; GDP- man nose pyrophosphorylase 

Rnj4 Ribonucleotide reductase small subunit 

Saml 5-AdenosyImethionine synthetase 1 

Sarn2 5-Adenosylmeihionine synthetase 2 
Sodl . Copper-zinc superoxide dismuiase 

Ubal Ubiquitrn- activating (EJ) enzvme 

YKL056 Resembles translaiionalrv controlled tumor protein of animal cells and higher plants 

YERl09(Ah P n Alkyl hydroperoxide leductase 

YMRJJ6 (AscJ) Abundant protein with effects on liansblional efficiency and cell size; has two WD (WD-40) repeals 



Accepted name from ihe Stifchotomytei genome datat>ase ami YPD. Names "rn parentheses represent recent changes. 
Com toy of Proi conic. Inc . reprinted with per minion. 
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FIG. 2. Correlation of protein abundance with adjusted mRNA abundance 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cell of the cognate mRNA, with an r of 076 Note the 
Joganthmic axes. Data for mRNA were taken from referees 27 and 30 and 
combined as described in Materials and Methods. 



correlation, confirming a strong correlation between CAI and 
protem abundance (Fig. 3). The relationship between CAI and 
protein abundance is Jog Jinear from ab6m 1,000,000 to about 
30,000 molecules per cell. We have no data for rarer proteins 
It is not dear whether CAI reflects, maximum or average 
levels of protem expression. The proteins used for the CA1- 
protein. correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, HspHM, Ssal, Adel, Arg4, His4, and others) 
When these proteins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitute) . protein* was recalculated, the r was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
molecules/cell) = (2.3 X CAJ) + 3.7. Thus, under certain 
conditions (a CAI of 0.3 or greater; a constitutively expressed 
gene), a very rough estimate of protein abundance can be 
made by raising 10 to the power of [(23 X CAJ) + 3 7] 

The distribution of CAJ over the genome (Fig. 4) consists of 
a lower, bell-shaped distribution, possibly indicating a region 
where there is.no selection for codon bias, and an upper flat 
distribution, starling at a CAJ of about 0.3. possibly indicating 
a region where there is selection for codon bias. Almost oil of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. Jn the lower, bell-shaped 
region, we do not know whether there is a correlation between 
CAJ and protein abundance. 

Changes in protein abundance in glucose and ethanol A 
comparison of cells grown in glucose (Fig. 1 A) with cells «rown 
in ethanol (Fig. IB) is shown in Table J. As is well known 
some proteins nie induced tremendously during growth on 
ethanol. Two sinking examples are the peroxisomal en7vmes 
JclJ (isoc.trate lyase) and C.i2 (citrate synthase), which are 
induced ,n ethanol by more than 300- and 12-fold, respectively 
(Hg- J; Table 1). These enzymes are key components of the 
glyoxy ate shunt, which diverts some acetyl coenzyme A 
(aceryJ-CoA) from the tricarboxylic acid cycle to gl»coneo°en- 
esis. X cerevtuac requires large amounts of carbohydrate for its 
cell wall; ,„ ethanol medium, this carbohydrate comes from 
gluconeogenics, which depends on the glyoxylate shunt and 
on. the glycolytic pathway running in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. 

During gluconeogenesis, substrate-product relationships are . 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse 
reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 mM), while Adh2 has a lower K m (0.8 mM)(5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Enol (enolase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by 
2-phospbogrycerate (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpyruvate (the gluconeogenic sub- 
strate). (4). Perhaps Enol has a lower K m for phosphoenol- 
pyruvate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased 
heat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (EftJ, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose: 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproleins. About 300 distinct spots, probably represent- 
ing 350 to 200 proteins, could be seen on pH 4-8 gels (Fig. 5B). 
We then aligned auloradiograms of three gels, each with a 
different kind of labeled protein ( 32 P only (Fig. 5B), 32 P plus 
S (Fig. 5A], and 35 S only [not shown, but see Fig. 1 for . 
example]). In this way, we made provisional identification of 
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FIG. 4. Distribution of CA1 over the whole genome, shown in intervals of 0.030 (i.e., there are 150 genes with a CA1 between 0.000 and 0.030, inclusive; 31 genes 
with a CA1 between 0.031 and 0.060; 269 genes with a CAJ between 0.061 and 0.090; 1,296 genes with a CA1 between 0.091 and 0.120; etc.). The distribution peaks 
with 2,028 genes with a CAI between 0.121 and 0J50. 



some of the 32 P-labeJed spots as particular 35 S-Jabeled spots. 
AJI such identifications pre somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigrate. Nevertheless,, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P- labeled proteins are the hexokinases PJxkl and 
Hxk2. the acidic ribosome-associaled protein RpaO, the trans- 
lation factors YeD and Efbl, and probably Hsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and Efbl are quan- 
titatively monophosphorylaled. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has five or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no ?2 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tifl has at least seven spots (7); two of these 
overlap with some -^P label, but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 3? P-labeled spots 
(a, c, and d in Fig. 5). Spots that seem to be extra forms of 
Met 6, Pdc], Eno2, and Fbal can be seen in Fig. 6A, but there 
is little 32 P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different protein isofortns seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 3? P-labeled proteins from cells synchronized in G, 
with a -factor, in cells synchronized in G, by depletion of G, 
cyclins, and in cells synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were -difficult 
to reproduce. The cell cycle proteins regulated by phosphory- 
lation may nor be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 35 S-tobeled ex- 
tracts by centrifngntion (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high-speed 
(100.000 X g, 30 min) centrifugal ion, while Fig. 613 shows the 
proteins in the pellet of a low-speed (16.000 X g t JO min) 
centrifugation. Many proteins are tremendously enriched in 
one h act ion or the other, while oihcis aie present in both. 



Most glycolytic enzymes (e.g^Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pfkl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Eftl, Yef3, Prtl, Tifl, and RpaO) are in the pellet, 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantial portion of the Efil. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 [not shown] and Uv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gspl is in 
the pellet fraction. The enrichment produced by centrifugation 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Tif] spot is surrounded by as many as six 
other spots that cofractionate. We observed six identical ot 
very similar additional spots when we overexpressed Tifl from 
a high-copy-number plasmid (not shown). Signal overlaps only 
one or two of these spots in 1? P-labeIing experiments (Fig. 5), 
and so the different forms are not mainly due to different 
phosphorylation states. 

DJSCUSSJON 

Our experience with developing a 2D gel protein database 
for S. cerevisiae is summarized here. With current technology, 
we can see the most abundant 1,200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods thai we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified 148, with a 
bias toward the most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene overexpression will be particu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 
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FJG. 5. Phosphoryhrlcd pinicins. (A) Mixture ol ^"P-labelcd proteins and ? \S-labelcd proteins. Two separate labeling reactions were done, one with j2 P and one 
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2D gels of the kind that we have used are not suitable for 
visualization of rare proteins. However it will be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu- 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the beat shock response. 

Gygi et al. (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
rnRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi ef al. feel that rnRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or rnRNA 
levels" (10). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus on the fact that the correlations of 
rnRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
range of rnRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and 1Q real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of rnRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r p values with some subsets of the 
data, Gygi et al. concluded that the correlation of rnRNA to 
protein was poor. However, (he r p correlation is a parametric 
statistic and so requires variates, following a bivariate normal 
distribution; that is, it would be valid only if both rnRNA and 
protein abundances were normally distribuled. In fact, both 
distribuiions are very far from normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion that codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r 5 ). Since this 
statistic is nonparametric, there is no requirement for the data 
to be normally distributed. Using the r s , we find that rnRNA 
abundance is well, correlated with protein abundance (r, = 
0.74), and the CA1 is also well correlated with protein abun- 
dance (r s - 0,80) (and also with rnRNA abundance [data not 
shown]). For the data of Gygi et a J. (10), we obtained similar 
results, though with their data the correlation is not as good; r 5 
- 0.59 for the rnRNA- to- protein correlation, and r s - 0.59 for 
the codon bias-to-prbtein correlation. 

Jn a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box- Cox and logarithmic, were used; both gave 
good correlations with our data [e.g., r p = 0.76 for I bg( adjusted 
RNA) to log(protein)]. Wc were not able to transform the data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to ihe different measure- 
ment techniques used: Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
r imaging of in I act gels coupled to image an a lysis. We com- 
pared our data to tbeiis for the pTOieins common between the 
studies (but excluding proteins whose mUNAs are known to 
differ between rich and minimal media, and excluding Till, 
which was anomalous in differing by 100- fold between the two 
data sets). The r 3 between the two protein data sets was 0.88 
{P < 0.0001). Although this is a strong con elation, the fact that 



it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data of Gygi et al. to be slightly higher than ours for 
the highest- abundance proteins and also for the lowest- abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do. not 
know what the errors are, we suggest the following as a rea-v 
sonable speculation. For the highest- abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the pbosphorimager screens. For 
the lowest-abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately cutting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with rnRNA abundance. We calculate 
that the r s is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r s is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for Ihe bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between rnRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorim aging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fact that we used both SAGE data and RNA 
hybridization data, which is most helpful for the least abundant 
mRNAs. Jn summary, we feel that the poor correlation of 
protein to rnRNA for the nonabundant proteins of Gygi et al. 
may reflect difficulty in accurately measuring these nonabun- 
dant proteins and mRNAs. rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because Ihe accuracy of measurement would 
be worse. 

How well C3n rnRNA abundance predict protein abun- 
dance? With r p ~ 0.76 for logarithmically transformed rnRNA 
and protein data, ihe coefficient of determination, {r p ) 2 , is 0.58. 
This means that more than half (in log space) of the variation 
in protein abundance is explained by variation in rnRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), 3nd (r p ) 7 = 
0.58 for the log data means that of this 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
rnRNA and about 10-fold is unexplained (but could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between rnRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of the proteome. First, there is an enormous range of 
protein abundance, from nearly 2,000 : 000 molecules per cell 
for some glycolytic enzymes to about 100 per cell for some cell 
cycle proteins l?6a). Second, about half of all cellular protein 
is found in fewer than J 00 different gene products, which are 
mostly involved in carbohydrate metabolism or protein synthc- 
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CycIooxygenase-2 expression in macrophages: modulation by protein 
kinase C-alplia. 

Giroux M , Descotcaux A 

Cyclooxygenase-2 (COX-2) is an inducible enzyme responsible for high levels of PG 
production dunng inflammation and immune responses. Previous studies with 
pharmacological inhibitors suggested a role for protein kinase C (PKC) in PG production 
possibly by regulating COX-2 expression. In this study, we addressed the role of PKC- 
aipha in demodulation of COX-2 expression and PGE2 synthesis by the overexpressing 

R AW°^!iTu? egatiVe - (DN) mUtaiU ° f tMs is o™y™* » <*« mouse macrophage cell line 
rSr 'nvesoffited me effect of various stimuli on COX-2 expression, namely 

• j ^l^™ 3 ' *"* the intrace 'l"'ar parasite Leishmania donovani. Whereas LPS- 
induced COX-2 raRNA and protein expression were down-regulated in DN PKC-alpha- 
overexpress.ng clones, IFN-gamma-induced COX-2 expression was up-regulated in DN 
r K.C-alpha-o verexpressing clones with respect to normal RAW 264 7 cells 
Measurements of PGE2 levels revealed a strong correlation between PGE2 secretion and 
IFN-garntna-mduced COX-2 mRNA and protein levels in DN PKC-alpha-overexpressing 
clones. Taken ^together, these results suggest a role for PKC-alpha in the modulation of 
LPS-^and [FN-gamma-induced COX-2 expression, as well as in IFN-gamma-induced - 
rub2 secretion. 
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Transcript profiling of human platelets usiug microarray and serial 
analysis of gcuc expression. 

Gnafenko DV , DuimJJ, McCorkleSR Weissmatin D . Perrotta PL. Bakou WF . 

Department of Medicine, Program in Genetics, State University of New York Stonv 
Brook 11794-8151, USA. ' 

Human platelets are anucleate blood cells that retain cytoplasmic mRNA and maintain 
functionally intact protein translational capabilities. We have adapted complementary 
techniques of microarray and serial analysis of gene expression (SAGE) for genetic 
profiling of highly purified human blood platelets. Microarray analysis using the 
Affymetrix HG-U95Av2 approximately 12 600-probe set maximally identified the 
expression of 2147 (range, 13%- 1 7%) platelet-expressed transcripts, with approximately 
22% collectively involved in metabolism and receptor/signaling,, and an 
overrepresentation of genes with unassigned function (32%). In contrast, a modified 
SAGE protocol using the Type IIS restriction enzyme Mmel (generating 2 1 -base pair 
[bp] or 22-bp lags) demonstrated that 89% of tags represented mitochondrial (mtj 
transenpts (enriched in 16S and 12S ribosomal RNAs), presumably related to persistent 
mt-transcnption in the absence of nuclear-derived transcripts. The frequency of non-mt 
SAGE tags paralleled average difference values (relative expression) for the most 
"abundant" transcripts as determined by microarray analysis, establishing the 
concordance of both techniques for platelet profiling Quantitative reverse transcription- 
polymerase chain reaction (PCR) confirmed the highest frequency of mt-derived 
transcripts, along with the mRNAsfor neurogranin (NGN, a protein kinase C substrate) 
and the complement lysis inhibitor clusterin among the top 5 most abundant transcripts. 
For confirmatory characterization, immunoblots and flow cytometric analyses were 
performed, establishing abundant cell-surface expression of clusterin and intracellular 
expression of NGN. These observations demonstrate a strong correlation between high 
transcript abundance and protein expression, and they establish the validity of transcript 
analysis as a tool for identifying novel platelet proteins that may regulate normal and 
pathologic platelet (and/or megakaryocyte) functions. 



PM1D: 12433680 [PubMed - indexed for MEDLINE] 



HEMOSTASIA THROMBOSIS, AND VASCULAR BIOLOGY 



Transcript profiling of human platelets using microarray and serial analysis 
of gene expression 

Dmitri V. Gnalenko, John J. Dunn. Sean R. McCorkle, David Weissmann, Peler L. Perrotta, and Wadie F. Bahou 



Human platelets are anucleate blood cells 
that retain cytoplasmic mRNA and main- 
tain functionally intact protein transla- 
tional capabilities. We have adapted 
complementary techniques of microarray 
and serial analysis of gene expression 
(SAGE) for genetic profiling of highly 
purified human blood platelets. Microar- 
ray analysis using the Affymetrix HG- 
U95Av2 approximately 12 600-probe set 
maximally identified the expression, of 
2147 (range, 13%-17%) platelet-expressed 
transcripts, with approximately 22% col- 
lectively, involved in metabolism and re- 
ceptor/signaling, and an overrepresenta- 
tion of genes with unassigned function 
(32%). In contrast, a modified SAGE proto- 
col using the Type IIS restriction enzyme 



Mme\ (generating 21-base pair IbpJ or 
22-bp tags) demonstrated that 89% of 
tags represented mitochondrial (mt) tran- 
scripts (enriched in 16S and 12S ribo- 
somal RNAs), presumably related to per- 
sistent mt- transcript ion in the absence of 
nuclear-derived transcripts. The fre- 
quency of non-mt SAGE tags paralleled 
average difference values (relative expres- 
sion) for the most "abundant" transcripts 
as determined by microarray analysis, 
establishing the concordance of both 
techniques for platelet profiling. Quantita- 
tive reverse transcription-potymerase 
chain reaction (PCR) confirmed the high- 
est frequency of mt-dcrived transcripts, 
along with the mRNAs for neurogranin 
(NGN, a protein kinase C substrate) and 



the complement lysis inhibitor cluster in 
among the top 5 most abundant tran- 
scripts. For confirmatory characterize- . 
tion, immunoblots and flow cytometric 
analyses were performed, establishing 
abundant cell-surface expression of clos- 
terin and intracellular expression of NGN. 
These observations demonstrate a strong 
correlation between high transcript abun- 
dance and protein expression, and they 
establish the validity of transcript analy- 
sis as a tool for identifying novel platelet 
proteins that may regulate normal and 
pathologic platelet (and/or megakaryo- 
cyte) functions. (Blood. 2003;101:2285-2293) 
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Introduction 



Human blood platelets play critical roles in normal hemostatic 
processes and pathologic conditions such as thrombosis, vascular 
remodeling, inflammation, ant! wound repair. Generated as cytoplas- 
mic buds from precursor bone marrow megakaryocytes, platelets 
are anucleate and lack nuclear DNA, although rhey retain 
megakaryocyte- derived niKNAs.'- 7 Platelets contain rough endo- 
plasmic reticulum and polyribosomes, and they retain the ability 
for protein biosynthesis from cytoplasmic mRNA. 3 Quiescent 
platelets generally display minimal translations! activity, although 
newly formed platelets such as ihose found in patients with 
immune thrombocytopenic purpura (lTP) synthesize various o> gran- 
ule and membrane glycoproteins (GPs). including GPJb and 
GPllb/llla (a tn $ y ). Furthermore, stimulation of quiescent platelets 
by agonists such as ©.-thrombin increases protein synthesis of 
various platelet proteins, including BcI-3 4 Like nucleated cells, the 
rapid translation of preexisting mRNAs may be regulated by 
integrin ligation to extracellular matrices ' In the case of platelets, 
the primary integrin involved in this process appears to be a ltb [3 3 
with cooperative signals mediated by the collagen receptor 



J nte grin-media ted platelet protein synthesis appears to be regulated 
at the level of translation initiation involving the eukaryotic 
initiation factor 4E (clHE). Jnstead of directly influencing eJF4E 
activity via posttranslational modifications (ie, phosphorylation), 
platelet eJF4E activity best correlates with its spatial redistribution 
to the mRNA-cnriched cytoskeleton. 8 Furthermore, because pro- 
tein translation is partially inhibited by the immunosuppressant 
rapamycin. it suggests that adhesion- and/or aggregation- induced 
outside- in- signaling function to regulate protein synthesis through 
the mTOR (mammalian target of rapamycin) pathway.** 8 - 9 

Despite the biologic importance of platelets and their intact 
protein synthetic capabilities, remarkably little is known about 
platelet mRNAs. Younger platelets contain larger amounts of 
mRNA with a greater capacity for protein synthesis, as determined 
by using fluorescent nucleic acid dyes such as thiazole orange.' 0 
This assay has been used as a quantitative determinant of younger 
or "reticulated" platelets (RPs). Indeed increased reticulated plate- 
lets are typically found in patients with conditions associated with 
rapid platelet turnover such ds FTP- typically RP percentages in 
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such patients approach J0% to 20% of all p!atelets > considerably 
higher than in healthy control subjects." Interestingly, high RPs 
have, been associated with enhanced thrombotic risk when identi- 
fied in patients with thrombocytosis,' 0 suggesting thai quantita- 
tively increased mRNA levels may be associated with the prothrom- 
botic phenotype. Whether this is related to globally altered gene 
expression profiles or to select changes more evident during 
situations of rapid platelet turnover remains unknown. Certainly, 
technical limitations of this assay limit its utility in defining 
prothrombotic genotypes, 10 " 12 and it cannot identify differentially 
expressed genes that may be causally implicated in disordered 
platelet phenotypes. 

Toward the goal of defining the molecular anatomy of the 
platelet genome, we have adapted complementary techniques of 
microarray and serial analysis of gene expression (SAGE) for 
genetic profiling of highly purified human blood platelets. Microar- 
ray technology represents a "closed" profiling strategy limited by 
ihe target genes imprinted onto gene chips. In contrast, SAGE is an 
"open" architectural system that can be used to identify novel genes 
and to quantify differentially expressed rnRNAs. 13 " 15 The sequence 
of each tag along with its positional location uniquely identifies the 
gene from which it is derived, and differentially expressed genes 
can be identified in a quantitative manner because the tag frequency 
reflects the mRNA level at the time of cellular harvest and analysis. 
By using boih technologies, we have identified a number of 
previously, uncharacterized genes that appear to be expressed in 
human platelets, while simultaneously establishing the dominant 
frequency of mitochondrial-expressed genomes comprising the 
platelet mRNA pool. These observations provide a panoramic 
overview of the platelet transcriptome, while additionally provid- 
ing insights into the molecular pathways regulating platelet.(and/or 
megakaryocyte) function in norma) and pathologic conditions. 



Materials and methods 

Reagents and supplies 

Thermits atftmiicus (Tnq) polymerase was purchased from (Roche, Indianap- 
olis. IN). T4 DNA ligasc was purchased Irony lnvirrogen (Carlsbad, CA), 
and restrict ion enzymes were from New England Biota bs (Beverly. MA), 
except for Mme\. which was obtained from the Center for Technology 
Transfer (Gdansk, Poland). All oligonucleotides were synthesized on an 
Applied Biosystems (Foster City, CA) 3-channel synthesizer and are listed 
in Table I. Monoclonal antibodies used for flow cytometric analysis 
included the F1TC (fluorescein isothiocyanate)-conjugafed anli-CD'H 
(o,| D p 3 ) immunoglobulin 01 (IgG); Irnmunotech, Miami. FL); phyco- 
erytbrin (PE)- conjugated antiglycophorin UgG2; Becton Dickinson Pharm- 
ingen. San Diego, CA): and peridinin chlorophyll protein (PERCP)- 
conjugated anti-CEM^ (IgOl: Becton Dickinson Phamiinecn). 

Platelet isolation, purification, and immunodetection 

All human subjects provided informed consent foi an 1RB (Institutional 
Review BoaidV-appiovcd pioiocol completed in conjunction with the 
General Clinical Research Center at Stony Brook University Hospital. 
Peripheral blood (20 mL) from healthy volunteers drawn into 2 mL of 4% 
sodium cituic t0.'I% vol'tol hnai concentration) was used to isolate 
erythrocytes by dincteniial ccnlrihrgaiion (1500^;) or ro isolate pine 
leukocytes by density gradient centrifugal ion as previously described. 
Platelets colJecied dom healrhy volunteers by apheresis were used within 
24 hours of collection A tier addition of 2 mM ED'IA (eihylencdiamineiet- 
raacetic acid), apheifsi^ -dt rived platelets from a single donor v.eie 
cenirihtged at l^iy for 15 minutes at 2i c C. To minimize leukocyte 
contamination, only the upper 9.-H) of the platelet- rich phj-ma fPRP) was 



used for gel filtration over a BioGel A50M column ( 1 000 mL total volume) 
equilibrated with HBMT (HEPES-buffered modified Tyrodes buffer. 10 
mM HEPES (Af-2-hyuVoxyeth'ylpiperazine-A^^2-ethat)csulfonic acid) pH 
7.4. 150 mM NaCI, 2.5 mM KG. 0.3 mM NaH 2 P0 4 , 12 mM NaHCOj, 
0.2% bovine serum albumen |BSA], 0.1% glucose, 2 mM EDTA). 
Gel-filtered platelets (GFPs) were subsequently filtered through a 5-tim 
nonwetting nylon filament filter (BioDesign, Carmel r NY) at 25°C and 
harvested by centri (ligation at I500g for 10 minutes at 25°C. Platelets were 
gentry and thoroughly resuspended in 10 mL HBMT buffer and incubated 
with 120 pX murine monoclonal anti-CD45 antibody conjugated to 
magnetic microbeads (Miltenyi Biotec, Bergisch Gladbach, Germany) on a 
rotating platform for 45 minutes at 25°C. Magnetic separation columns 
were used to capture CD45* cells (leukocyte fraction) by positive selection 
. (MACS II; Miltenyi Biolec). Purified platelets were concentrated by 
centrifugal ion at 1500g and immediately used for total RNA isolation. 

The efficiency of platelet purification was documented at each step by 
flow cytometry. 17 Briefly, aliquols containing 2 X I0 6 platelets were 
incubated with saturating concentrations of FITC- conjugated ariti-ClMI, 
PE-conjugated antiglycophorin, and PERCP-conjugatcd anli-CD45 for 15 
minutes in the dark at 25°C, washed -with phosphate- buffered saline (PBS), 
and fixed in PBS/1% formalin. Samples were analyzed using a FACScan 
(fluorescence-activated cell sorter scan) flow cytometer (Becton Dickinson) 
using CELLQuest software designed to quantify the number of CD45 + and 
glycophorin- positive events in the sample f expressed as the number of 
events per 100 000 CD41 + events). For some experiments, fixed platelets 
were permeabilized with 0. 1% Triton-X/PBS for 30 minutes at 25*C prior 
to the addition of primary antibodies, all as previously described.' 7 

Platelet protein detection was completed by sodium dodecyl sulfate 
(SDS)-polyacrylamide gel electrophoresis (PAGE) and immunoblot analy- 
sis as previously described, using the species- specific horseradish pcroxidase- 
conjugated secondary antibody and enhanced chemiluminescence," Anti- 
bodies included the anliclusterin monoclonal antibody (Quidel, Santa Clara, 
CA; 1:1000 primary and 1:10 000 secondary) and the antincurogranin 
rabbit polyclonal antibody (Chemicon International, Temecula, CA; 1:1000 
primary and 1:10 000 secondary). 

Molecular analyses and microarray profiling 

Purified, individual cell fractions were resuspended in 10 mLTrizol reagent 
(Invitrogen), transferred into die thylpyrocar bona tc (DE PC)- treated Corex 
(Springfield, MA) tubes, and serially purified and precipitated by using 
isopropanol essentially- as previously described. 16 Total cellular RNA was 
harvested by centrifugation at I2 500# for 20 minutes at 4°C, washed 2 
times with 75% ethanol (10 mL/hibe). and resuspended in 100 u-L 
DE PC- treated water. Platelet mRNA quantitation was performed by using 
fluorescence-based real-time PCR (polymerase chain reaction) technology 
(TaqMan Real-Time PCR; Applied Biosystems. Foster City. CA). Oligonu- 
cleotide primer pahs were generated by using Prime 1 3 software (www- 
genome. wi.mit.edu), designed to generate approximately 200-base pair 
(bp) PCR products at the same annealing temperature, and are outlined in 
Table !. Purified platelet mRNA (4 u-g) was used for first-strand cDNA 
synthesis using oligo(dT) and Superscript II reverse transcriptase (Invitro- 
gen). For teal- time reverse transcription |RT)-PCR analysis, the RT 
reaction was equally divided among primer pairs and used in a 40-cycIe 
PCR reaction for each target gene by using the following cycle: 94 °C for 30 
seconds, 55°C for 30 seconds, 72°C for t minute, and 7I°C for 10 seconds 
(40 cycles total). mRNA levels were quantified by monitoring real-lime 
fluorometric intensity of SYI3R green 1. Relative -mRNA abundance was 
determined from triplicate assays performed in parallel for each primer pair 
and was calculated by using the comparative threshold cycle number (A-O 
method) as previously described. xi 

Gene expression profiles were completed by using the approximately 
I? 600 probe set HG-U95Av2 gene chip t A ifymeti i.x. .Sam a Clara, CA). 
lota I cellular RNA (.*► pg) was used Ini cONA synthesis by using 
SuperScnpi Choice system (bile Technologies. Rock vi He. MD) and an 
olioo(dT) primer containing the "17 polymerase recognition sequence 
t Primer SI; Table 1), followed by t DNA purification using GFX spin 
columns. lr» vitro transcription was completed in the presence of biotinyl- 
ated ribonucleotides by using a BioAiray High Yield RNA Transcript 
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PLATELET TRANSCRIPT PROFILING 2287 



Table 1. Oligonucleotide primers 



Gene and primer 
direction 



Sequence (5' - 3') 



Nucleotide 
Position 



SI 


Qligo(dT) 


Cassette A 


SAGE 


Cassette B 


SAGE 


S2 


SAGE . 


S3. 


SAGE 


GP4 


Glycoprotein JIB |F) 


GPS 


Glycoprotein HB [R| 


GP6 


Glycoprotein IIIA JF) 


GP7 


Glycoprotein IIIA |R) 


PAR 18 


PARI |FJ 


PAR19 


PARI [RJ 


SR6 


16SrRNAfF] 


SR9 


1BS rRNA |R) 


NADH10 


NADH2rF) 


NADH11 


NADH2 (RJ 


THYM12 


Thymosin B4 [FJ 


7HYM13 


Thymosin p4 [RJ 


CLUS14 


Cluster in |F) 


CLUS15 


Clusterin |RJ . 


NRG 16 


Neurogranin (FJ 


NRG 17 


Neurogranin fRJ 




T-cell receptor 


7CR1B 


p-chain JFJ 




T-ceH receptor 


7CR19 


p-chain |R| 


CD4520 


CD45 |Fl 


CD4521 


CD45 (RJ 



5*- Bn-GGCCAGTGAATTGTAATACGACTCACTATJVGGGAGGCGG- {dl 
; y 

5 ' - TTTGGATTTGCTGGTCGAGTACAACTAGGCTTAATCCGA CATG - 3* 
3'- *CCTAAACGACCAGCTCATGTTGATCCGAATAAGGCTp- S' 
3 ' - pT TCATG GCGGAGACGTCCGCCACTAGTGTCG<lAACTGACTA * - 3' 
5 ' - WAAGT ACCGCCTCTGCAGG CGGTGATCAC AGCGTTGA CTGAT - 5* 
M . 

¥ - Bn - GGATTTGCTGGTCGAGTACA - 3* 

> ' - Bn - TAGTCAGGTGCGACACTAGTGGC- 3 " 

i ' - AGGG CrTTGAGAGA CTCATCTGTA - 3 ' 

ACAATCTTGCTGTTTGGATTCTG- 3' 
i ; - TATAAAGAGGCCACGTCTACrTTC - 3 ' 
»'- CACTTCCACATACTGACATTCTCC- 3 ' 
/-AATGTCAGTTCTGATATGGAAGCA- 3' 
i' - CCCAAATGTTCAAAC UC 1 1 iA GC- 3 ' 
. ' - TGCAAAGGTAGCATAATCACTTGT - 3 ' 
'-GTTTAGGACCTGTGGGTTTGTTAG- 3' 
* - CTAGCCCCCATCTC AAATCATATAC - 3 * 
' - AATGGTTATGTTAGGGTTGTACGG - 3 * 
; - AAGACAGAGACGCAAGAGAAAAAT- 3 ' 
' - GCAGCACAGTCATTTAAACTTGAT- 3 ' 
' - CCAACAGAATTCATACGAGAAGG- 3 ' 
'- CGTTATATTTCCTGGTCAACCTCT- 3 * 
" - GCCCmTAGTTAGTTCTGCAGTC- 3 ' 
' - TTTTCTTT AAGTGAGTGTGCTTGG - 3 ' 



5 - CCACAACTATGTTTTGGTATCGT- 3 ' 

5 ' - CT AG CA CTGCAG ATGTAG AA GCT - 3 ' 
5 * - GCTCAGAATGGACAAGTA - 3 * 
5 ' - CACACCCATACACACATACA - 3 ' 



2094-2117 
. 2301-2279 
2335-2358 
2532-2509 

2585- 2608 
2776-2753 

2586- 2609 
2765-2762 
4875^t898 
5075-5052 

135-158 
336-313 
1006-1028 
1222 1 199 
135 M 374 
1567-1544 



332-310 
3771-3788 
4280-4261 



(FJ indicates forward (sense) strand; (RJ. reverse (antisense) strand; Bn, biotin; p. a phospborylaled 5' end (cassettes A and B); underlining. MtoHl'siles in cassettes A and B; 
a/rows, corresponding sequence for S2 and S3 wiibin cassettes A and B. respectively; bold, the Wmel site; and N. A. C. 7, or G. nucJeolide position based on the following 
accession numbers: glycoprotein IIB (J02764), glycoprotein IIIA (M35999). PARI (M62424). 16S rRNA and NADH2 (NC_ 001807). thymosin 04 (M17733). clusterin (M25915). 
neuiooranin (X99076). TCR p-chain (AF043 182), CD45 (Y0063B). 

* Indicates an arnino-rnodified 3' end in both cassettes: — . not applicable. 



Labeling Kit (Enzo Diagnostics, Fnmiirigdale. NY), and, after metal- 
induccit fragmentation, 15 ng biorinyiated cRNA was hybridized to the 
JlG-U95Av2 oligonucleotide probe array for ]6 hours at 45"C. After 
washing, the cRNA was detected with streptavidin-phycoerythrin (Molecu- 
lar Probes, Eugene, OR) and analysis was completed by using a Hewlett- 
Packard Gene Array Scanner {Afiymcirix). The average difference value 
(AD) for each probe set was quantified using MAS 4.01 software 
|AiTymciri>), calculated as an average of fluorescence differences for 
perfectly matched versus single-nucleoiide mismatched 25-mer oligonucle- 
otides (16 to 20 oligonucleotide pairs per probe set). The software is 
designed to exclude "posilivc calls" in the presence of high average 
differences with associated high mismatch intensities. 

SAGE profiles 

Plate U- 1 SAGF. libraries were generated essentially as previously de- 
scribed. 1 ' modi lied as outlined in figure I for the use of Mme] as the 
lagging enzyme.'* This type US restriction enzy me cleaves ?Q of 1 8 bp past 
lis norma) indiomic (TCCRAC) recognition sequence thereby generating 
longer rags I? I- or ??-mer) than those obtained using Bsmfl as the standard 
t:iggm» cn?ynic (13- M bp tags). These longer \ime\- generated raps 
potentially provide for more definitive "iai;-to-uene " identification and are 
particularly useful in characterizing expression patterns in the absence of 
complete genomic sequence data (comprehensive me j hods derailed in Dunn 
ct a! 1 *). Briefly, polyt A) mRNA was isolated from II) ( ig roral platelet RNA 
mine the oligo-(dT) Si primer conjugated to magnetic beads (Dynal 
Biotech. Lake Success. NY), followed by cDNA synthesis using Super- 
script II fevcise transcriptase ilrtvirrogen). I he cDNA was ihen digested 



with the restriction enzyme NiaU) (anchoring enzyme), ligated to cassette A 
using T4 DNA ligase, and, aflcr the beads were extensively washed, the 
cDNA was digested with Mme) to release the lags from the beads. After 
purification, tags were ligated to degenerate cassette B lingers (specifically 




t£g|sj [ © """" 2' ' ***** '* " j 



I- 



—TTTTT- TOCrCI 



Figuip 1. Schema outlining the moiriftpd SAGE protocol used in platelet 
anatyics. The hn;»l tags r.te flanked r>y the NtoUl |.»nr.hon»g en?ym<?) CA1G 
sentience, thereby providing km to- gene idenijftcation when exported to a relationnJ 
dalab;»se (refer to "FJioinromtatic analyses' and lohle t for details). 
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designed to anneal lo ihe nonuniform Mmel overhangs), and PCR-amplified 
using biotinylated primers S2 and S3 for 30 cycles (95°C for 30 seconds; 
. 58°C for 30 seconds; 72°C for 30 seconds) using Platinum Taq DNA 
polymerase (Gibco BRL). A fraction (20%) of the pooled PCR products 
were then subjected lo one round of linear amplification using primer pair 
S2/S3, followed by a second round of 25 amplifications using primer S2 
alone (95°C for 30 seconds, 58°C for 30 seconds, 72°C for 30 seconds). 
Primer S3 was subsequently added for one cycle (95°C for 2.5 minutes, 
58°C for 30 seconds, 72°C for 5 minutes); the latter steps were collectively 
adapted to exclude hcteroduplcx formation." 1 Unincorporated primers were 
removed by incubation with 200 U Escherichia colt exonuclease 1 for 60. 
minutes at 37 < "C. PCR products were then pooled and digested with AVoJH 
to release tags, and biotinylated linker amis were cleared using strept avid in- 
coated immunoaffinity magnetic beads (Dynal Biotech). Tags were con- 
catamerized using 5 U/pX T4 DNA ligase, and products more than 100 bp 
were isolated by size- fractionation in low-melting agarose gels. The DNA 
was purified by GFX spin columns, and the conca tamers were cloned into 
the Sph\ site of pZero (Invitrogen). After transformation into Ecoli TOPI 0 
cells, recombinant clones were isolated and sequenced in 96-well microliter 
plates using an ABI 377 sequencer and ABI Prism BigDye terminator 
chemistry (Perkin- Elmer Applied Biosyslems. Branchburg, NX). . 

Bioinformatic analyses 

Functional grouping of genes determined lo be present by Affymetrix MAS 
4.01 software was performed using a dChip program linked to the National 
Center for Biotechnology LocusLink, which is an annotated reference 
database for genes and their postulated functions. 70 Of the approximately 
12 600-probe sets represented on the Affymelrix HG-U95Av2 Gene chip, 
Junctional annotations exist for approximately 8100 with the remainder 
categorized as unknown. Microarray data were visualized and analyzed 
using BRB- Array Too Is software (Version 2.1), kindly developed and 
provided by Dr Richard Simon and Amy Peng (Iinus.nci.nih.gov/BRB- 
ArrayTools.html). A logarithmic (base 2) transformation was applied lo the 
average difference values for individual data sets for determination of 
microarray conconlancies. Discordancy was defined as a 2- log difference in 
the maximum log intensities between individual experiments. 

SAGE lags were extracted by using in-house SAGE software uniquely 
modified to identify Afmel tags. Ihe software ensures that only unambigu- 
ous 2t- lo 22-bp tag sequences ;ire extracted for transcript profiling. Tags 
with ambiguities (Ns), lengths other than 21 or 22 bp, or with ambiguous 
orientations were extracted to separate files for manual editing or further 
examination. Finalized data were exported to a relational database for tag 
quantification and genetic identification/ 0 



Results 

Platelet purification 

To ensure that the RNA profiles accurately represented those of 
circulating blood platelets, a number of complementary methods 
were implemented to remove con lain inn ting nucleated leukocytes. 
Purification methods incorporating gel filtration, a 5-u.m leukocyte 
reduction filter, and magnetic CD45 immunodepletion allowed for 
the cumulative enrichment of highly purified platelets. The efficacy 
of this purification method was initially established by using 
peripheral blood platelet. rich plasma as the starting material. The 
final product contained no more than 3 lo 5 leukocytes per IX lfp 
platelets as deter mined by parallel flow cytometric analysis, 
represent ins. an appto\im;He 4 M)- fold i eduction of nucleated 
leukocytes. These results eon elated well with molecular evidence 
for leukocyte depletion as dele i mined by RT-PCR using both CD 15 
and T-celi receptor ("i- chain (TCKfJ) primers (see Figure 2). 
Decause the total RNA yield horn peripheral blood platelets was 
insufficient for mieioanay studies. w C adapted the protocol to 
platelet npberesis dono;s with nearly identical final purity (Figure 



A B 




Cyd* Mjrribof 

PRP Platelets 



Figure 2. Determination of platelet purity. (A) Total cellular RNA (1.8 pa) from 
platelet-rich plasma (PRP) or purified platelets from a single apheresfs donor were 
analyzed by RT-PCR (35 cycles) using oligonucleotide prfmers specific tor glycopro- 
tein lib (GPtlb). T-ceB receptor p-chain (TCRp), or CD45; t0>l ol the 50 reactions 
were analyzed by elhtdium-stamed agarose get electrophoresis. Minimal to no TCRp 
gene product was visually evident only in PRP. Size markers corresponding to 
Haeltt-restricted 40(174 DNA are shown. (B) Real-time RT-PCR was completed by 
using 1.8 jig total RNA and TCR0-specific oligonucleotide primers optimized lor 
quantitative analysis by real-time PCR.* 8 On the basis of parallel determinations 
using RNA isolated from known amounts of purified leukocyte standards, the 
leukocyte-depiction protocol represents an approximate 2.5-tog purification from the 
starting PRP. Results are representative ol one complete set of experiments repeated 
on 2 separate occasions, and data points represent the mean from triplicate wells, 
with standard errors of the mean (SEM) less than 1 % (not shown). 

2). The platelet recovery was nearly 65% of the starting material, 
yielding approximately 2.3 X 10" platelets from an initial aphere- 
sis pack containing approximately 3.6 X 10" platelets. The bulk of 
the losses occurred during the initial centrifugal ion and filtration 
steps. The purification protocol was less effective at removing 
erythrocytes, although there were less than 50glycophorin-positive 
cells per J X 10* platelets after the final purification step. Nonethe- 
less, these cells represent unlikely sources for contaminating 
cellular RNA (see "Cellular microarray analysis" below). 

Cellular microarray analysis 

The purified platelet RNA was sufficient for microarray studies and 
was used for cRNA generation and hybridization to the Affymetrix 
I )G-U95Av2 GeneChip. The anatomic profile of platelet RN As 
from 3 healthy male donors was determined by using Asymetrix 
software. Of the J2 599 probe sets imprinted onto the chip, a 
maximum of 2147 (17%) transcripts were computationally identi- 
fied as "present" by Ihe Affymetrix software, 152 (1.2%) were 
equivocal, and nearly 82% were absent. As a fraction of the total 
genes present on the chip, Ihe percentage of platelet- expressed 
genes (15%- 17%) was generally lower lhan that obtained from 
other human cell types in which 30% to 50% of genes are present as > 
determined by Affymetrix software (J. Schwedes, personal commu- 
nication. May 2002). The "limited number" of plalclct-expressed 
transcripts presumably reflects the lack of ongoing gene transcrip- 
tion in the anuclcate platelet. Because less than 1% of circulating 
red blood cells contain residual RNA, it is unlikely that any of these 
transcripts are erythrocyte derived, although this was formally 
addressed by isolating total cellular RNA from 20 ntL of whole 
blood (corresponding to an -3-log fold excess of erythrocytes than 
thai identified in our final sample). The total ceiluhn yield of RNA 
front this starting material was approximately 250 ng r suggesting 
that less than I ne erythrocyte- derived RNA was present in the 
purified platelet preparations. Despite this, however, both o- and 
P-globin transcripts— a lorn.' with both the fcrriiin heavy and light 
chains — were identified as abundant transcripts (Table 2\ Al- 
lhough the most parsimonious explanation would be residual contami- 
nating reticulocytes, this is not supported by our erythrocyte contami- 
nation estimates, and their significance remains unresolved. 

As a means of better dissecting the molecular anatomy of ihe 
plate lei. expressed, genes were grouped on the basis of assigned 
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7 able 2. Top 50 human platelet-expressed 9enes 



Accession no. 


Gene symbol 


AD values, range* 


Gene transcript 


Leukocyte expression* 


M17733 


TMSB4X 


140142-307 852 


Thymosin 84 mRNA. complete cds 


+ 


X99076 


NRGN 


101 510-148279 


Neurogranin gene 


+ 


M25079 


HBB 


40 839-229 556 


0-gtobtn mRNA, complete cds 


+ 


M25915 


CLU 


84 720-140 246 


Complement cylotysrs inhibit of (ctusler'in) comptole cds 




J04755 


FTHP1 


B2 980-148 621 


Fern I'm H processed pseudogene, complete cds 


- 


D78361 


OAZ1 


73 098-118 140 


mRNA for ornithine decarboxylase anlizyme 




X04409 


GNAS 


77 761-94 781 


mRNA for coupling protein G(s) o-subunit (alpha-Si) 


- 


M25897 


PF4 


62 81 M 26 908 


Platelet factor 4 mRNA, complete cds 




AB021238 


B2M 


61689-108 021 


^-microglobulin 




X00351 


ACTB 


25 143-73 775 


mRNA for B-aclin 


- 


D21261 


TAGLN2 


76 687-101 931 


mRNA for KIAA0120 gene . 




At 03 1670 


FTLL1 


69 865-99 966 


Ferritin, light polypeptide 1 


• ■ + 




' GPDB 


41 404-110 32B 


Platelet glycoprotein tbp chain mRNA 


_ . 


M21121 


CCL5 


47-308-106 399 


T-ce 5- specific protein (RANTES) mRNA, complete cds 








ill 11R.QK ft? R 


Llnsp5ced mRNA for glutathione per oxidase 




J00153 


HBA1 




Alpha gtobin gene cluster on chromosome 16 






MYL6 


46 337-106 833 


Nofixnuscle/srnooth muscle alkali myosin light chain gene 


•* 


L20941 


FTHt 




Ferritin heavy cfiam mRNA, complete cds 




J03040 


SPARC 


Ci -tee 1A 1C1 


SPARC/oslconeclin mRNA, complete cds 




X56009 


GNAS 


45 543-72 096 


GSA mRNA for q subunil of GsGTP binding protein 




X58536 


HLA 


31 183-82 613 


mRNA tor major HLA class 1 locus C heavy chain 


+ 


M54995 


PPBP 


46 571-67 t69 


Connective tissue activation peptide III mRNA 




U34995 


GAPD 


35 095-70 250 


Normal keratinocyte substraction library mRNA, clone H22a 




140399 


MLM3 


32 107-73 364 


Clone zap112 (muiL protein homolog 3) mRNA 




X77548 


NCOA4 


31 452-61 036 


cDNA tor RFG {RETprolo-oncogene RET/PTC3) 




. U90551 


H2AFL 


35 086-51 892 


Histone 2A-like protein (H2A/I) mRNA 




Ml 1353 


H3F3A 


31 614-55 813 


H3.3 histone class C mRNA 




Z 12962 


RPL41 


36 003-54 853 


rnRNA for homologue to yeasl ribosomal protein L4 1 




X06956 


TUBA1 


£\J 98B-D1 / ifS 


HALKrtA 44 gene TOT a-iuDtuin 




A rj n "> cj Ci r. r\ 




">A K.~7\ K11 
/** 3( l-iKJ oil 


mfJKJA fm IflAA 1fV>7 nrrAB.m 




Y 1271 1 




SJ bqU-4 J 1/4 


niRNA for putattvp progesterone binding protein 




M16279 


MIC2 


30 894-40 16b 


Integrated membrane protein (MIC2) mRNA 






YWMAM 




Drain 14-3-3 protein fj-chain 




Af-0/UDOD 


TOP3B 


/U u//-b/ 945 


Clone 24675, unknown cDNA 




AAS2480? 


UnKnown 


23 846-39 481 


LUNA, IMAt^fc ClOOe W/U 




AB009010 


UBC 


28 745-38 389 


mRNA for potyubiQoilin UbC 


+ 


X57985 


H2AFO 


2 » 6/8-52 108 


Genes for histones 1 t2B. 1 and H2A 




X54304 


MLCB 


25 733-34 109 


mRNA for myosin regutatory fight chain 




M 14539 


F13A1 


23 691-48 474 


Factor XIJI subunit n -polypeptide mRNA, 3' end 




AI540958 


Unknown 


24 872-41 118 


cDNA, PEC 1.2_15_HOI.r5* end/clon 




AL050396 


FLNA 


13 634-55 235 


cDNA DKFZp 586K1720 




X56841 


HLA-E 


12 890-49 327 


Nonctossical MHC class 1 antigen gene 




M26252 


PKM2 


15 450^17 786 


7CB (cylosolic thyroid hormone-binding protein) 




M 14630 


P7MA 


19 314-45 088- 


Prolhymosin alpha mRNA 




AF 046229 


RGS10 


19 156-34 243 


Regulator of G protein signaling 10 mRNA 




AA477898 


Unknown 


16 863-44 756 


cDNA. Z&34t08.ri 5' end 




X95404 


f LI 


15 216-37 456 


mRNA tor nonmuscle type cofttin 




M34460 


n GA2B 


8 627-45 495 


Ptaletet glycoprotein Ho (GPHb) mRNA 




283738 


M?Bre 


T8 001-31 306 


HH?B/e gene 




L 19779 


H2AFO 


17 319-38 951 


Histone H2A.2 mRNA. complete cds 





'Gerie c^pre.ssinn nuaniiffcations were calculated as ihe overage drtference (AD) value {maiched versus mismatched oligonucleotides) tor each piobe set using Artymetitx 
GeneChip solrwaie. vets ion 4.01. The range of values fiom 3 distincl plate lei mic roan ays is shov/n; the normalization value to» all microanay analyses was 250. 

pianstripfs are i:*nk- ordered [hiohesl to lowest) using BRB- Array loots software by log- intensities of AD values obtained from 3 different healthy donors; 33 of the top 40 
Transcripts were tisied among the top 50 in all 3 mictoanay sets. 

J Leucocyte e> prcssion was Uelennined by mitioarray analysis using purified peripheral blood leukocytes, lollowed by construction ot rank- intensity plots lc> comparison to 
platelet top SO tinnscripls. ro 1op leukocyte-derived Iransciipts identified within the ranked top 50 platelet transcripts are depicted by a ( ■* ) piesent. or (- ) ot>se»l. 

cds indicates cod>ng sequence 



^rnc rmnoktlions. rind thi^ onn lysis w;ts used to provide a prm- 
OTitmic tlcliniiion ol the p);itek j i transci)pionu:.'Of ihe »em's lltal 
con id be citnloijed within nsstgned "clusters." those involved in 
mei.nholism {))%) nnd lecepior/signnlinu (11%) icpiescnicd the 
In i {jest (jronps. Also evident in these analyses is the relatively laiee 
percentage ot tjenes involved in functions unrelated to these key 
groups tie. nii.scellaneons. 25%). and the ovcrrepresejitation of 
penes ^i'h unknown funeiion (37%) as annotated by AlTymctm 



and RelSeq databases.- 1 These icsnlts identify a vast array (nearly 
one half) of platelet genes (and penc products) that presumably 
have important, but poorly characterized functions, in platelet 
and'ot megakaryocyte biolouy. 

Although microarray analysis is not truly quantitative, rank- 
oidering using ihe mean log- intensities from 3 independent rTiicroar- 
r:iy analyses allowed for the categorical ton of fhe top platelet 
transcripts (Table 7). Computational analyses demonstrated that 
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only 10 of the lop 100 genes were discordant among the 3 platelet 
microarrays, although 71 of 100 genes were discordant between 
platelet and leukocyte arrays. An inventory of me top 50 platelet 
genes is listed in Table 2, which also delineates those found to be 
highly expressed in peripheral blood leukocytes by parallel microar- 
ray experiments with this purified cellular fraction (data not 
shown). Further analysis of these cell subsets demonstrated that 
approximately 25% (n = 547) of the total platelet transcripts were 
platelet restricted. Furthermore, only 10 of the 50 most highly 
expressed genes were found to overlap, confirming the distinct 
cellular profiles of each iranscriptome. Of the 12 overlap genes, 3 
corresponded to globin or ferritin chains (again suggesting the 
presence of contaminating reticulocytes in both purified fractions), 
and another 4 were involved in actin cyloskeletal reorganization 
and human leukocyte antigen (HLA) expression, gene products that 
regulate critical functions in both cell types. Given the importance 
of cytoskeletal reorganization in downstream platelet activation 
events, it is not unexpected that components of the actin machinery 
system would demonstrate prominent transcript expression. Previ- 
ous estimates suggest that 20% to 30% of the total platelet 
proteome is comprised of actin with other components such as 
actin- binding protein mysosin, and talin accounting for an addi- 
tional 2% to 5% of the total protein. 1 - 22 The mRNAs encoding the 
actin-related machinery are overrepresented in. our micrparTay 
analysis, with 8 such transcripts found among the 50 highest 
platelet- expressed genes. Interestingly thymosin pM demonstrated 
the highest expression pattern. In unstimulated platelets, 30% to 
40% of actin is polymerized as F- actin," whereas the balance of 
actin monomers (G- act in) are polymerization inhibited by seques- 
tering proteins such as profilin (100 uM) and thymosin 04 (600 
pM). 73 The high thymosin 04 transcript expression not only 
correlates with its known abundance in platelets but also supports 
the importance of actin inhibitory proteins in maintaining the 
nonstimulated state of circulating platelets. 
Platelet SAGE analyses 

Although these initial studies identified the distribution and relative 
expression patterns, of the genes within the Asymetrix data set. 
they do not allow for analyses of genes thai arc unrepresented by 
these oligonucleotide chips. Unlike closed microarray profiling 
strategies, SAGE is an open architectural system that is ideally 
stilted for novel gene and pathway identification. Accordingly, the 
platelet RNA used for microarray studies was used for platelet 
SAGE A total of 2033 lags were initially cataloged, of which J 800 
(89%) corresponded to mitochondrial-derived genes. These results 
were quite different from those obtained by microarray analyses, 
but the discrepancy can be resolved by the nonrcpresemation of the 
mitochondrial genome on the gene chip. The mitochondrial ge- 
nome is a compact approximately J6.6-kilobase (kb) sequence 
encoding 13 genes and 2 ribosomal subuoits. 74 Primary mttochon 
drinl transcripts arc polycistronic and typically contain premature 
ic mi i nation or unpredictable splice si lev. resulting in multiple 
polyadenylated transcripts from individual genes. ^-^ Indeed, the 
overall distribution of platelet- derived mitochondrial SAGE tags is 
quite .similar to that found in muscle. 7 - All 13 genes containing 
NUiUl sites were detected, whereas neither _ of the non-A'/Vdll- 
contaimng genes were identified t nicotinamide adenine dinuclco- 
tidc | NADU J dehydrogenase summit -11. and adenosine triphos- 
phatase (ATPase) S). Most of the tags were from the I6S and I2S 
ribosomal RNAs— which collectively accounted for 68% of the 
total mitochondrial lags— with the fewest tags represented by 
NADIi dehydrogenase ?u bun its .1.5. (\ and cytochrome c oxidase I 



AT*3AC* 
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Figure 3. Schema of the mitochondrial genome with SAGE tag distributions 
(only tags with identical matches are displayed). The abundance of the SAGE 
tags [n ~ 1800) at individual Ntelll sites (arrows) within the mitochondrial fteavy 
strand is shown on the bottom, whereas those tags corresponding lo the mitochon- 
drial light strand are delineated above the arrows (the presence of an unaccompanied 
arrow implies no SAGE tags at that Mat II site). The gene products of mt-DNA 
(RefSeq accession no. NC_001807) are delineated by the open rectangles, whereas 
stippted boxos represent tRNA genes and control regions (the single tag represented 
by the (') refers lo rnitochondriat transfer RNA-serine). Note that MAOH6 is encoded 
by the fight strand and that there are no Affattt sites within the ATPaseB gene segment. 
COJnJ, cytochrome c oxidase suburwt; Cyt b. cytochrome b. 

(Figure 3). The NADU dehydrogenase subunit 6 RNA is the only 
mRNA encoded by the light (L) strand of mitochondria) DNA and 
was the least abundantly detected transcript. 

The unusually high preponderance of mitochondrial-derived 
genes is not inconsistent with the known enrichment of these 
genomes in human platelets,'- 34 and presumably reflects persistent 
transcription from the mitochondrial (ml) genome in the absence of 
nuclear-derived transcripts. This overrepresentation of mlDNA in 
platelets is considerably greater than that of its closest cell type 
(skeletal muscle), in which mt genomes represent approximately 
20% to 25% of all SAGE tags. 25 Interestingly, the energy metabo- 
lism of platelets is not dissimilar from that of skeletal muscle, both 
cell types actively using glycolysis and large amounts of glycogen 
for ATP generation. 76 Like muscle, platelets are metabolically 
adapted to rapidly, expend large amounts of energy requited for 
aggregation, granule release, and clot retraction. Similar lo the 
situation in all eukaryotic cells, platelet mitochondria represent the 
primary source of ATP, which is generated from oxidative phosphor- 
ylation reactions occurring within these organelles. Mitochondria 
are also responsible for most of the toxic reactive oxygen species 
generated as by-products of oxidative phosphorylation and are 
central regulators of the apoplotic process in other cellular types. 
The mtDNA encodes polypeptides found within 4 of the 5 
multifunctional complexes that regulate oxidative phosphorylation 
within the platelet mitochondria. 27 Whether the continued genera- 
tion of these polypeptides has a role in platelet energy metabolism 
and/or the apoptotic mechanisms regulating platelet survival re- 
mains speculative, although not inconsistent with our observations. 



Comparative analysis of SAGE and microarray 
transcript abundance 

Complete SAGE libraries require the sequencing of up to 30 000 
tags for an exhaustive cataloging of individual mRNAs, especially 
those with limited copy numbers.' 3 ^ Given the preponderance of 
ml- derived transcripts, comparable sampling would have required 
sequence analysis of nearly 300 000 SACK tags, an inordinate 
number for comprehensive analysis of the platelet iranscriptome. 
l-oi platelets, alternative methodologies incorporating subtr active 
SAG!: will be required for more comprehensive transcript profil- 
ing.' 1 ' Our initial sampling of n on mitochondria I genes remains 
informative, however, and entirely consistent with. the results of 
platelet microarray studies. As shown in Table 3, SAG!; tags for the 
genes encoding thymosin p>1, f*?- microglobulin, neurogranin. and 
i be platelet glycoprotein Ibp polypeptide were among the most 
frequently id en ti lied platelet genes, similar to the rank -ordered 
results determined by mi cr nan ay analysis. To formally confirm the 
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Table 3. SAGE-identified nonmitochondnal tags 



Frequency 


CATG + SAGE lags* 


Accession no. t 


Gene 


Microarrayt 


26 


G T T G7GGTT AATC T GGT 


NM_ 004048.1 


p2-mtcroglobuJir>iB2M), mRNA 


PPP 


21 


TTGGTGAAGGAAGAAGT 


NM_021 109.1 


Thymosin p4 ; X chromosome (T MSB4X). mRNA 


P 


B 


AGC1CCGCAGCCAGGTC 


NM_002620.t 


Platelet factor 4 variant 1 (PF4VI); mRNA - 


V 


8 


AGCTCCGCAGCCGGGTT 


NM_002619.1 


Platetel factor 4 (PF4) f mRNA 


P 


7 


T GTATAAAGACAACCTC 


NM_ 002 704.1 


Proplaretet baste protein (B-thromboglobolin) 


Pp 


5 


GGGCACAATGCGGTCCA 


NM_000407.1 


Glycoprotein Ib0 polypeptide, mRNA 


P 


3 


AGGTAATAAAAGGTAAT 


NM_003512-1 


H2A htstone family, member L (H2AFL). mRNA 


P 


3 


AGTGGCAAGTAAATGGC 


NM_021914.2 


Cofilin 2 (muscle) (CFL2). mRNA 


N/A . 


.3 


T GACTGTGCTGGGTTGG 


NM_006176.t 


Neurogranin (protein kinase C substrate. RC3) mRNA 


P 


3 


TTGGGGTTTCCTTTACC 


NM_ 002032.1 


. Ferritin, heavy polypeptide 1 (FTH1), mRNA 


P 


2 


CCCTTGTGACTACCTAT 


NM_02515B.1 


Hypolheticar protein FU22251 (FLJ22251). mRNA 


NfA 


2 


CCT GTAACCCCAGCTAC 


NM_032779.T 


Hypothetical protein FLJ14397 (FLJ14397). mRNA 


N/A 


2 


CTTGTAGTCCCAGCTAC 


NM_01796Z1 


Hypothetical protein FLJ2082S (FLJ20825), mRNA 


N/A 



•Unique tags identified more than once. . 

1 Refers to the RefSeq accession no. 71 Note that this number does not necessarily correspond to the accession no. provided by Affymelrix software annotations (Table 1). 

JPresence (P) or absence (A) is based oh results from 3 distinct platelet microanay experiments. Capitalized "P" designates a gene that is in the top 50 on att 3 microarray 
experiments, whereas small *p" designates those transcripts not.in the top 50. Two of the genes (p2-microgk>buBn and f^ttvc^bog^birfin).are represented by 3 and 2 probe 
sets, respectively, on Ihe HG-U95Av2 gene chip; for p2-M. all 3 probe sets were in the top 50 genes, whereas for thymosin B4 1 of 2 was in Ihe top 50 for aD experiments (the 
other probe set was in the top 75 tor all experiments). N/A indicates oligonucleotide not present onAtfymetrix HG-U95Av2 gene chip. 



results independently obtained by SAGE and microarray analysis, 
quantitative RT-PCR was completed by using oligonucleotide 
pTimers specific for 2 abundant mitochondrial transcripts, 16S 
rRNA and NADH2 thymosin p4 (high- abundance by microarray 
and SAGE), 2 incompletely characterized high-abundance tran- 
scripts (neurogranin and clusterin; see "Protein immunoanalysis of 
platelet clusterin and neurogranin"), a low-abundant transcript 
(T-cel) receptor ^-polypeptide), and the genes encoding proteins 
with well-established quantitative determinations (ie, glycoprotein 
a nnp3 [~50 000 receptors/platelet]; protease-activaled receptor- 1 
(PARI) [-1800 receptors/platelet]).' As shown in Figure 4, these 
analyses reveal excellent concordance between SAGE and microar- 
ray studies, demonstrating the predominant frequency of the 
mitochondrial-derived 16S rRNA/NADH2 transcripts, with incre- 
mentally lower expression of other transcripts as initially demon- 
strated by microarray (1 6S > NADH2 > thymosin £4 > neurogra- 
nin > clnstcnn > tt, lrj p., > PARI > TCR(3). 

Given the small number of nonmitochondrial SAGE tags 
available for analysis (n = 233), limited conclusions can be drawn 
using traditional (nonsubtraction) platelet SAGE libraries as pre- 




' Figure 4. Oonntilative icol-timc FU-PCR analyst of platelet transcripts, fleal- 
lime RT-PCR was compii-ted by using purified platelet RNA and oligonucleotide 
primer poiis specifically dt'J.igned using Pnmer3 software (o generate similarly ;,ized 
(-?00bp) PCR P'OJucts. optima cd to Ihe same annealing temperature. In graph. 
(□) i«?pre;.ents lib. t») reprt':cn(s Ilia. (.'^) re-presents PARI, t» represents t&S 
»RNA, (T) mpi events MADH2. (▼) rep<esenls thymosin, (O) represents c.iuslerin, i*) 
pepre^enis neurogranin. and (•) represents T CP() Curves are representative o( one 
comple te set ol e> penmrnts |rppeaied r.vice). and line plots reflect averaye 

detetitlinaltof.v horn 3 v;rl'b [-.Prtomied in pnmltet with GEM less Ihrin 1% lor nil 
d;ily points. 



sented here. Overall, a total of 126 unique tags were identified, the 
majority of which (94) were represented only once. Of the total 
unique tags, nearly one half represented novel genes hot present on 
the Affymetrix U95Av2 GeneChip. Of the genes with unique tags 
identified more than once, there was excellent concordance with 
microarray expression analysis, with nearly all of the SAGE tags in 
Table 3 corresponding to platelet top 75 microarray transcripts. The 
platelet factor (PF) 4 variant represents a single aberration because 
this was rank-ordered approximately 350 by microarray, although, 
its SAGE tag frequency was identical to that of the predominant 
PF4 transcript. The lack of extensive nonmitochondrial SAGE 
sampling precludes any further extrapolations from this apparent 
aberration. Of note, a subset of these lags had long poly(A) tracts; 
although they all corresponded to genes identified in the RefSeq 
database. 71 We cannot exclude the possibility of a SAGE artifact 
for this small subset of. tags (~2%, representing 46 of 2033 tags), 
although the authenticity of the vast majority of tags (—98%) 
clearly validates Ihe methodology. These lags are most likely 
explained by the unique biology of the platelet (ie, mRNA decay in 
the absence of de novo transcription) or to mRNA degradation 
occurring during the extensive purification methods. In summary, 
even with a remarkably limited sampling, the power of this 
approach in gene identification of relatively abundant and less 
abundant transcripts is evident. It is clear, however, given the 
unique molecular anatomy of the platelet (ie. abundance of 
mitochondria) transcripts), that SAGE adaptations will be required 
for more comprehensive genetic profiling. 79 

Protein irnmunoannlysis ol platelet ctuster'tn and neurogranin 

AJihough most of the "most abundant 11 transcripts would conform 
to a priori predictions for platelet-expressed mRNAs, a number of 
transcripts were identified that had been poorly characterized in 
human plalcleis. To further establish the authenticity of highly 
expressed transcripts such as clusterin and neurogranin, confirma- 
lory protein analyses were completed. As shown in figure 5, both 
proteins were clcarlv delected in purified platelet Jvsates; further- 
more, i heir cellular platelet distributions conformed lo l hose 
predicted based on previously proposed functions. Noic for c.x- 
ample thai clusterin — functionally characterized as a complement 
lysis inhibitor able to block the terminal complement teaclion— is 
primarily expressed on (he extruc ellului platelet niembiane. jn 
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Non-permeabitbed 






Figure 5. Immuooanafysrt of pistole* neuroflranin and 
clusterm. (A-D) Get-filtered ptateteb, wera either fixed in 
3-7% torni3Weh^o (rK>npefmo3blfeod) of fixed with pormo- 
abtfizaSon in the presence of 0.1% TritonOC; followed by Row 
cyloxnetncanarysfcusbgar^^ . 
rogranin antibodies and the FnOconjurpted species- 
specific secondary antibody K C the FFT&ccfyiigated 
anfrabbft and oniirnouse controls are essentially supertrn- 
posed). (E-F) Ten micrograms of scrubifced HepG2 cete 
(hepatccyte cell Bne). human brairx w puified pbtetel rysales 
were analyzed by SOS-PAGE." i&nd imrrwnoblc* analysis 
were compteted by using 1:1000 oTMSoris of either antineuro- 
granin (18% SOS^PAGE) cf antkiusierin (8% SDS-PAGE) 
antibodies. The antidustonn anQwdy recognized 2 ptateiet 
immunoreactive species under, shorter exposue, Although 
the relative newogranin and dusterin protein abundances 
are subop&nolry quantified by rhese.arioryses. ptatelel dus- 
terin appears to o^nwnstrate consktersblo expression when 
compared with that previous*/ identified in hepatocytes. M 



Given the importance of complement activation in platelet destruc- 
tion, the prominent expression of cell-surface clusterm might 
suggest a role for this protein in normal and pathologic events 
regulating platelet survival. Interestingly, a ciusterin-deficient 
knockout mouse has been generated that demonstrates enhanced 
cardiac dysfunction in a model of autoimmune myocarditis. 3 ' 
Although these mice apparently have normal baseline hemograms 
(B. Aronow, personal communication, October 2002), it remains 
unestablished if they would be predisposed to immune-type 
thrombocytopenia in systemic models of autoimmunity. 

Similarly, the gene encoding an intracellular effector protein 
that may have key roles in downstream platelet activation events 
has now been demonstrated to have abundant transcript expression 
in human platelets Neurogranin is a highly expressed platelet 
transcript with its gene product demonstrating a primarily intracel- 
lular pattern of distribution. Neurogranin is generally described as a 
brain- specific, Ca 24 -sensitive calmodulin-binding phosphoprotein 
that is preferentially expressed in neuronal cell bodies and den- 
drites. 32 -' 0 It is a specific protein kinase C (PKC) substrate that enn 
also be modified by nitric oxide and other oxidants to form 
intramolecular disulfide bonds. Both its phosphorylation and 
oxidation state attenuate its binding affinity for calmodulin. 33 In 
stimulated platelets, PKC generation is linked to various activation 
pathways such as calcium-regulated kinases, mitogen-actrvated 
protein (MAP) kinases, and receptor tyrosine kinases.' Thus, these 
observations suggest that platelet neurogranin may function as a 
previously unidentified component of a PKC-dependcnt activation 
pathway coupled to one (or more) of these effector proteins. 



Discussion 



a mixed population of variably aged platelets, a "static'* mRNA 
profile represents an average of this heterogeneous blood pool. 
Despite these potential limitations, the combination of genomic 
and proteomic technologies are likely to provide powerful tools for 
the global analysis of platelet function. Current strategies for 
cataloging "whole cellular proteomes" are generally accomplished 
by using 2 developing methodologies:. ( 1 ) high resolution 2-dimen- 
sional polyacrylamide gel electrophoresis (2-DE) with mass spec- 
trometric sequence identification/" and (2) microcapillary liquid 
chromatography with tandem mass spectrometry (pLC-MC/ 
MC). 35 Further modifications of both procedures have been devised 
for direct comparative studies between 2 cellular proteomes. The 
introduction of 2-DE differential gel electrophoresis has now made 
it possible to detect and quantify differences between experimental 
sample pairs resolved on the same 2-dimensional gel. 36 Likewise, 
the application of isotope-coded affinity tags to uXC-MC/MC 
represent a novel means of quantitative analyses between cellular 
proteomes. 37 The success of both approaches relies on the availabil- 
ity of comprehensive genomic databases and mathematical algo- 
rithms for optimal protein identification. Indeed, mathematical 
modeling smdies have demonstrated the need to delineate both 
protein and mRNA expression levels for optimal definition of 
intracellular networks. 3 * Our data present an initial framework for 
delineating platelet function by defining the molecular anatomy of 
human platelets, information that is likely to provide important 
clues into the dynamic protein interactions regulating normal and 
pathologic platelet functions. Furthermore, because the platelet 
transciiptome mirrors the mRNAs derived from precursor 
megakaryocytes, these analyses may provide insights into the 
biochemical and molecular events Tegulaling megakaryocytopoi- 
es>s andA>r pioplatelet formation. 



These data provide documentation for a unique platelet mRNA 

profile that may provide a tool foi analyzing platelet molecular Acknowledgments 
networks. Nonetheless, the molecular analysis of the platelet 
tiaiiSL-ripioine may be confounded by the constant decay of 
mRNAs in the absence of new gene transcription, a situation that 
mav. for example. limit the identification of low- abundance 
transcripts. Similarly, because the circulating platelet poo) contains 
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Overexpression of a DEAD box protein (DDX1) in neuroblastoma and 
retinoblastoma cell lines. 

Godbout R , Packer M Bie VV. 

Department of Oncology, Cross Cancer Institute and University of Alberta, 1 1560 
University Ave., Edmonton, Alberta T6G 1 Z2, Canada. 

The DEAD box gene, DDX1, is a putative RNA helicase that is co-amplified with 
MYCN in a subset of retinoblastoma (RB) andneuroblastoma (1MB) rumors and cell lines 
Although gene amplification usually involves hundreds to thousands of kilobase pairs of 
DNA, a number of studies suggest that co-amplified genes are only overexpressed if they 
provide a selective advantage to the cells in which they are amplified. Here, we further 
characterize DDX1 by identifying its putative transcription and translation initiation sites. 
We analyze DDX1 protein levels in MYCN/DDX1 -amplified NB and RB cell lines using 
polyclonal antibodies specific to DDX1 and show that there is a good correlation with 
DDX1 gene copy number, DDX1 transcript levels, and DDX1 protein levels in all cell 
lines studied. DDX1 protein is found in both the nucleus arid cytoplasm of DDX1- 
amplified lines but is localized primarily to the nucleus of nonamplified cells. Our results 
indicate that DDX1 may be involved in either the formation or progression of a subset of 
NB and RB tumors and suggest that DDX1 normally plays a role in the metabolism of 
RNAs located in the nucleus of the cell. 



PMID: 9694872 [PubMed - indexed for MEDLINE] 
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Modulation of gap jauctiou mediated iutei cellular couinumicatiou in TM3 
Lcydig cells. 

goldcabcrgRC , Fortes FS, Oistancuo JM, Morales MM Franci CR. Vanmda WA 
Campos de Carvatlio AC : — - — : ' 

Institute of Biophysics Carlos Chagas Fillio, UFRJ, Brazil. 

L °ST ™ dulation of intercellular communication via gap junctions was investigated 
m ,y^ ^ g Under l0W and h 'S h conflu <™e states, and upon treatment of the 

u Knt t,meS W ' th activators °f Protein kinase A (PKA) and protein kinase C 
Y r y f ellS in low ^nfuence were readily coupled, as determined by transfer of the 
dye Luafer Yellow; on reaching confluence, the cells uncoupled. Western blots and RT- 
PCR revealed that connexin 43 (Cx43) was abundantly expressed in TM3 Leydig cells 

o^^ 1011 W3S decreased after the cells achieved confluence. Stimulation of PKA 
or P1CC induced a decrease in cell-cell communication. Staurosporin, an inhibitor of 
protein kinases, increased coupling and was able to prevent and reverse the uncoupling 
actions of dibutyryl cAMP and I2-0-tetradecanoyl-phorbol-l3-acetate (TP A) Under 
modulat.on by confluence, Cx43 was localized to the appositional membranes when cells 
Tr,^ ™ d W3S mainly U,e ^Pl*™ when they were uncoupled. In addition 
cAMP and TPA reduced the surface membrane labeling for Cx43, whereas staurosporin 
increased it. These data show a strong correlation between functional coupling and the 
membrane d.stribuuon of Cx43, implying that this connexin has an important role in 
intercellular communication between TM3 cells. Furthermore, increased testosterone 
secretion m response to luteinizing hormone was accompanied by a decrease in 
intercellular communication, suggesting that gap junction mediated coupling may be a 
modulator of hormone secretion in TM3 cells. 
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Expression level of Ubc9 protein in rat tissues. 



Golcl>iowski £, S Z utc A , Sakowicz M , SzutowiczA, PawelczvkT . 

Department of Molecular Medicine, Medical University of Gdansk 80-2 1 1 Gdansk 
Poland. ' ' 

Ubc9 is a homologue of the E2 ubiquitin conjugating e^e and participates in the 
cova ent Imbng of SUMO-1 molecule to the target protein. In tins report we describe a 
simple and efficient method for obtaining pure human recombinant Ubc9 protein The 
punfied Ubc9 retained its native structure and was fully active in an in vitro sumoylation 
assay with the promyelocyte leukaemia (PML) peptide as a substrate. In order to better 
understand the physiology of Ubc9 protein we examined its levels in several rat tissues 
Immunoblot analyses performed on tissue extracts revealed quantitative and qualitative 
differences m the expression pattern of Ubc9. The Ubc9 protein was present at a high 
level m spleen and lung. Moderate level of Ubc9 was detected in kidney and liver Low 
amount of Ubc9 was observed in brain, whereas the 18 kDa band of Ubc9 was barely 
visible or absent m heart and skeletal muscle. In heart and muscle extracts the Ubc9 
antibodies recognized a 38 kDa protein band. This band was not visible in extracts of 
other rat tissues. A comparison of the relative levels of Ubc9 mRNA and protein 
inchoated that the overall expression level of Ubc9 was the highest in spleen and lung In 
spleen, lung, kidney, brain, liver and heart there was a good correlation between the 1 8 
kDa protein and Ubc9 mRNA levels, in skeletal muscle the Ubc9 mRNA level was 
unproportionaliy high comparing to the level of the 1 8 kDa protein. The presented data 
indicate that in the rat the expression of the Ubc9 protein appears to have some degree of 
tissue specificity. . e^ui 
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ABSTRACT 

Motivation: Protein abundance is related to mRNA 
expression through many different cellular processes. 
Up to now, there 1 have been conflicting results on how 
correlated the levels ot these two quantities are. Given that 
expression and abundance data are significantly more 
complex and noisy than the underlying genomic sequence 
information, it is reasonable to simplify and average them 
in terms of broad proleomic categories and features (e.g. 
functions or secondary structures), for understanding their 
relationship. Furthermore, it will be essential to integrate, 
■ within a common framework, the results of many varied 
experiments by different investigators. This will allow one 
to survey the characteristics of highly expressed genes 
and proteins. 

Results: To this end, we outline a formalism tor merging 
and scaling many different gene expression and protein 
abundance data sets into a comprehensive reference 
set, and we develop an approach for analyzing this in 
tetms ol broad categories, such as composition, function, 
structure and localization. As the various experiments are 
not always done using the same set of genes, sampling 
bias becomes a central issue, and our tormalisrn is 
designed to explicitly show this and correct for it. We apply 
our formalism to the currently available gene expression 
and protein abundance data lor yeast. Overall, we found 
substantial agreement between gene expression and 
protein abundance, in terms ol the enrichment ol stiuctural 
and tunctional .cotegories. This agreement, which was 
considerably greater than the simple correlation between 
these quantities tor individual genes, reflects the way 
broad categories collect many individual measurements 
into simple, robust averages. In particular, we iound 
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that in comparison to the population of genes in the 
yeast genome, the cellular populations of transcripts and 
proteins (weighted by their respective abundances, the 
transcriptome and what we dub the translatome) were both 
enriched in: (i) the small amino acids Vat, Gly, and Ala; 
(ii) low molecular weight proteins; (iii) helices and sheets 
relative to coils; (iv) cytoplasmic proteins relative to nuclear 
ones; and (v) proteins involved in 'protein synthesis,' 'cell 
structure/ and "energy production.' 
Supplementary information: http://genecensus.org/ 
expression/translatorne 
Contact: mark.gerstein@yale.edu 

INTRODUCTION 

High throughput experimentation, measuring mRNA 
(Schena ei al, J995; Eisen and Brown, J 999; Ferea 
and Brown, J 999; bipsbuiz et at, 1999) and piotein 
expression (Anderson and Seilhamer, 1997; Fulcher et al, 
1999; Gygi et a!. r 1999a; Ross-rvlacdonald et al, J 999; 
Lopez, 2000; MacBeath and Schreiber, 2000; Nelson 
et al y 2000; Zbu et al, 2000) are currently the single 
richest source of genomic information. However, how to 
best interpret this data is stiff an open question (Bassell 
pi al, J996; Wines and Friedman, 1999; Zhang, J 999; 
Cerstejn and Jansen, 2000; Searls, 2000; Sherlock, 2000; 
Ctaverie, 1999; Einarson and Golernis, 2000: Epstein and 
Butow, 2000; Shapiro and Harris, 2000). Understanding 
how protein abundance is related to mRNA transcript 
levels is essential for interpreting gene expression, protein 
interactions, structures and functions jn a cellular sys- 
tem (Hntziroanikatis etal ; 1999). Moreover, as protein 
concentration is the more relevant 'variable with respect 
to enzyme aciiviiy, it connects genomics to the physical 
chemistry of (he cell (Kidd et ol.. 2001). Protein abun- 
dance may also be invaluable for diagnostics and for 
determining drug targets (Cor dials et a I . 2000). 
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Previously, we surveyed the population of protein 
features — such as folds, amino acid composition, and 
functions — in yeast, and other recently sequenced 
genomes (Gerstein, 1997, J998a,b; Gerstein and Hegyi, 
J 998; Hegyi and Gerstein. 1999; Das and Gerstein, 
2000; Lin and Gerstein, 2000), and we extended this 
concept to compare the -population of features in the 
yeast transcriptome to that in the genome (Drawid et 
ai, 2000; Jansen and Gerstein, 2000). Others have also 
done related work (Frishman and Mewes, 1997; Tatusov 
et ai, J 997; Jones, 1998; WaJlin and von Heijne, 1998; 
Frishman and Mewes, 1999; Wolf et ai, 1999). Here, we 
present a new methodology to compare the features of the 
. mRNA expression population with the protein abundance 
population. 

Precise terminology is essential for this comparison. 
Unfortunately, 'proteome* is used inconsistently. Pro- 
teome can logically be used to describe all the distinct 
proteins in the genome (Qi et ai, 1996; Cavalcoli et ai, 
1997; Fey et ai, 1997; Carrels et ai, J997; Gaasterland, 
1999; Jones, 1999; Sali, 1999; Tekaia et ai, 1999; 
Bairoch, 2000; Cambillau and Claverie, 2000; Doolittle, 
2000; Pandey and Mann, 2000; Rubin et ai, 2000) and, 
in this context, it is equivalent to what others may refer 
to as the coding part of the genome. However, in papers 
on two-dimensional (2D) electrophoresis, it is often used 
to describe the sum total of proteins in a cell, taking 
into account the different levels of protein abundance 
(Shevchenko et ai, 1996; Gygi et ai, 2000a; Lopez, 
2000; Washburn and Yates,.2000). In an effort to be clear, . 
we propose the term 'translatome' for this second usage 
of proteome. ( 

With this definition, we are able to refer compactly to 
three different cellular populations. These are illustrated 
in Figure I. 

(i) We use the term genome when we refer to the 
population of open rending frames, where each ORF 
counts once. 

(ii) We use the term transcriptome when we refer to 
the population of mRNA transcripts. This term was 
originally coined by Vclculescu et al. (J 997). Note 
that each ORF may give rise to different numbers 
. of transcripts. Consequently, the nauscriptome is 
essentially the same as the genome but with each 
ORF weighted by its expression level, 
(iii) The ne,\l level is the cellular population of proteins. 
As each protein represents n translated transcript, 
we make an analogy with the. term transcriptome 
and use the term translatomc as described above 
to describe this third population. Thus, the trans- 
la tome is a subset of the genome wheie each 
ORF is weighted by its associated level of protein 
abundance. 



Note that one could also less compactly call the trans- 
latome a 'weighted proteome/ However, doing so assumes 
one of the two aforementioned definitions of proteome. To 
avoid ambiguity, we studiously avoid the use of proteome 
altogether in the paper. 

Differences between the trans la tome and the transcrip- 
tome exist given that transcripts from different genes 
can give rise to different numbers of proteins, due to 
different rates of translation and protein degradation. 
Post-transcriptiona! modifications further affect the 
translalome. 

In our analysis of the transcriptome and translatome, we 
focus on global protein features rather than the compari- 
son of individual genes. Previous analyses have shown that 
differences between mRNA expression and protein abun- 
dance levels can be quite dramatic for individual genes. 
This may either be due to the noise in the data or to funda- 
mental biological processes. However, our analyses show 
that the variation between transcriptome and transJatome 
is much smaller for global properties that are computed by 
averaging over the properties of. many individual genes. 

METHODS 
Data sources used 

For our analysis we culled many divergent data sets, 
representing protein abundance and mRNA expression 
experiments and also other sources of genome annotation. 
These are all summarized in Table J. 

Biases in the data 

The databases that annotate the specific genes may 
not always be accurate (lshii et ai, 2000). Gene Chip 
experiments suffer with regard to cross hybridization 
and the saturation of probes. SAGE data degrades for 
lowly expressed mRNAs. 2D gels are unable to resolve 
membrane proteins (approximately 30% of the genome) 
and basic proteins (Gerstein, 1998c; Krogh et ai, 2001). 
In addition, the procedures for identification and quan- 
tification of the protein spots are subject to uncertainties 
(Haynes and Yates, 2000). Human biases include the 
lack of low abundance proteins (Fey and Larsen, 200J; 
Gygi et ai, 2000b; Harry et ai, 2000) and the differences 
between Jabot atones in sample preparation. Our reference 
expression data set attempts to resolve these problems. 

Da Li set scaling 

A reference set for mRNA expression. With many differ- 
ent mRNA expression data .rets available, it is worthwhile 
to integrate them into a single unified lefeieuce set : with 
the intention of leducinc the noise and errors contained in 
the individual data sets and to obtain a unified estimate of 
(he normal expression Mate in a cell. 
We adopt rui iterative scaling and merging formalism. 
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Fig. I. Schematic overview of the analysis. On the Jeft-side we outline the terms we use to describe the process of gene expression. The 
coding section of the genome is transcribed into o population of mRNA transcripts called the 'trahscriptoroe/ The transcripts in turn are 
translated to a population of proteins; we use the term 'transJalome* for this protein population rather than the alternative 'proteome* because 
the latter term may be confounded with the protein complement of the genome (which is not necessarily associated with a quantitative 
abundance level). . 

The matrix in the middle schematically shows an analysis of the three stages of expression. In general, we define a protein 'population' as 
a set of genes associated with a corresponding number of expression or abundance levels ('weights'); Jn the matrix each row represents a 
weight and each column a gene set. In particular, we differentiate between the mRNA reference expression set (GmRNA = ^Gen). which 
essentially covers the complete genome, and the reference protein abundance set (Gj> loi ) which contains the proteins in data sets 2-DE#I and 
2-DE #2 (see Table 1) because the protein abundance set is a significantly smaller subset of the genome. By definition, this subset contains 
only proteins that can be identified by 2-D gel electrophoresis and is therefore biased in this sense. The enrichment figures throughout this 
paper, through a comparison of the right- and left-sides of this figure, show the results of the experimental biases of 2D gels on the data set. 
Each pie chart represents □ composition of a particular protcin.featuie F (for instance, an amino acid composition) in a population (represented 
by the symbol f.i). We can further look at the 'enrichment' of this feature in one population relative to another (represent cd by the symbol A, 
see Section 'Methods* /or an explanation of the formalism). 



which we summarize below. We present a more detailed 
review of the meihoris on our web site. 

We stan with the values of one gene chip data set Ui 
where i is used throughout as a subscript to denote gene 
number. We then transform the values of (he next Gene 
Chipdat3 set X; to with the following non-Jinear regres- 
sion: min£ ( - (}', - Ui) 7 with T, = AX? where A and B 
are I he para meters of the regression'. Note that two Gene 
Chip sets may not be defined fot the same set of genes, 
so we have to perfotm the fit only over the genes com- 
mon lo both sets. The motivation for scaling is that the 
dynamic range ol observed expression levels varies some- 
what between different data sets, although cell types and 
growth conditions :ne vr.ry similar. Reasons lor dispar- 
ity may include diffeient calibration procedures for i clai- 
ms? fluorescence intensity ro a cellular concentration (men- 
s tried in copies of transcripts pet cell) or differ en l pro- 
tocols for harvesting and reverse- transcribing (he cellular 
mRNA. 

We then merce and average the data to create a new 



reference set V as follows: 



If Ui and Y; are both defined for gene i and 



\Yj - Uj\ 
Yj + U, 



Then V' ( = ±(Yi -f Ui) 

Else if only Yi exists, V; = Yj 

Else V; = Ui . 

As presented above, where only one data set has a value 
for the corresponding ORE, we incorporated that value 
and did not exclude it. When both. data sets have values 
for an ORE. we averaged the values if they were within 
of each other; otherwise, we just stayed with the 
original chip data set Ui- We used u = 15% In otder to 
prevent outliers from skewing the result. This \5 ( 7c value is 
a reasonable threshold for excluding outliers though other 
values i e.g. 10 or 209c) would give similar results (data 
not shown). Other data sets are subsequently included in 
the same procedure, continuing the iteration from the new 
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expression values V;. The initial iteration starts with the 
Young Expression Set, as U tt since we have the highest 
confidence in its accuracy. 

The SAGE data (Velculescu et aL t 1997) was not 
included in the above procedure syrce it is of a funda- 
mentally different nature! An advantage of the SAGE 
technology over Gene Chips is that there is no possible 
signal saturation for high expression levels, as is possible 
for chips (Futcher et al y 1999). Conversely, SAGE values 
are less reliable for lowly expressed genes since there 
is a chance that one might not sequence a SAGE tag 
corresponding to such a gene altogether. Therefore, if 
after the last iteration, the average Gene Chip expression 
level V t was both above a certain threshold p and below 
the SAGE expression level S t for the same gene, it was re- 
placed with the SAGE value; otherwise the average Gene 
Chip value was kept. This gave us our final expression 
set w mRWA . Our treatment of the SAGE data is modeled 
after that in Futcher et al (1999), and like them, we used 
/?=16. 

This incorporation of the SAGE data into the reference 
data set ensures that the highly expressed outliers are as 
accurate as possible. 

Rather than plain arithmetic averaging, this overall scal- 
ing procedure with the a cutoff avoids 'artificial averages' 
that combine very different values for a particular gene. 
Some expression values might be statistical outliers, in 
addition, it may be possible that the expression levels of 
a variety of genes can only be within mutually exclusive 
ranges or modes, such as when two alternative pathways 
are switched on or off. Simply averaging these would give 
values thai are less representative of the particular mode 
values. This situation is analogous to that in averaging 
together an ensemble of protein structures (i.e from NMK 
structuie determination). Each structure could be sieieo- 
chemicaJly correct, with all side-chain atoms in predefined 
roiamer configurations. However, an average of all struc- 
tures could yield one that is stereochemical^ inconect if 
this involved averaging over particular side- chains in dif- 
feient rotameric slates. 

With regaid to our regression analysis, we have investi- 
gated both non- linear and linear fits but found a non- linear 
procedure to be mere advantageous. The non- linear rela- 
tionship between different expression data sets perhaps 
reflects saturation in one or more of the Gene Chips — not 
an uncommon phenomenon. This non- linearity is imme- 
diately evident on scatter plots of two data sets against 
one another (see website). Accordingly, the non- linear 
fn producer a smaller residual than the linear fit: 98 297 
(non- linear) versus 122 182 (linear) for the scaling of the 
Church datrt set and 59 828 (non-linear) versus 67-162 
(linear) for the Samson data set. 



A reference set for protein abundance. We followed a 
similar procedure to calculate a reference protein abun- 
dance set from the two gel electrophoresis data sets. We 
first scaled the two data sets against the rnRNA expres- 
. sion reference data set, getting regression parameters Cj 
and Df 

where the subscript j indicates the data set 2-DE #) or 
2-DE #2 respectively; Pj j is the pTptein abundance value 
in data set j y and u; m RNA,; the corresponding reference 
expression value, and Cj and Dj are the parameters of 
the non-linear regression. 

Using these parameters, we transformed the values of set 
2-DE #2 onto 2-DE #1. Then we combined both sets into 
the reference protein set wj^ by averaging them, if both 
values existed. Otherwise, by using the existing value, viz: 

Wior.t = + £?u)/2 if both Pi j and Q i 2 exist. 
Else if only Pi j exists, wp tol i = Pj j 
Else if Qi <) exists, wp>or.r = (?/ 2- 

Enrichment of features 

Formalism. In the next part of our analysis, we want 
to group a number of proteins together into various 
categories based on common features, and characterize 
those features that are enriched in one population relative 
to another, i.e. the translatome population of proteins 
ns measured by 2D gels relative to the transcriptome 
population of transcripts or the genome population of 
genes. To this end, we set up a formalism that could 
be applied universally to all the attributes thai we were 
interested in. Due to the limitations of the experiments, 
the translatome, transcriptome, and genome populations 
are defined on different sets of genes, and sometimes we 
want to remove this 'selection bias' by forcing them to be 
compared on exactly the same set of genes. This is a key 
aspect of. our formalism as presented in Figure 1 

We call an entity like |w, O) a 'population/ where 0 
is a set describing a particular selection of genes from the 
genome and w is vector of weights associated with each 
element of this population. In particular, we focus on three 
main populations here: 

0) M ? fjCrn] is the population of genes in the genome..- 
nil 6280 genes weighted once (w — J)-. 

(ii) h v n»RNA ; mRNA ] is the observed population of the 
transcripts in the transcriptome. i.e. the 62-19 genes 
in the reference expression set weighted by their 
reference expression value: 
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(iii) {wproi* Gp Iot ) * s ! be observed cellular population of 
the proteins in the translators, i.e. the 181 genes 
in the reference abundance set weighted by their, 
reference abundance value. 

(The set of genes in the genome Gcen >s approximately 
equal to the genes in set G m RNA> such that we can use 
both symbols interchangeably.) We can also use this nota- 
. tion to describe specific experiments — e.g. [w^x, G\ ac z) 
describes the gene set and weights relating to the transpo- 
son abundance set. 

Furthermore, we define Fj as the value of a feature F 
in ORF j. For example, F could be the composition 
of leucine (a real number) or a binary value (0 or J) 
indicating whether an ORF contains a trans-membrane 
segment. Given these definitions, the weighted average of 
feature F in population [w, G) is: 



M(^lw,G.]) = 



The weighted averages of two populations |w, G] and 
[v, S] can be compared by simply looking at their relative 
difference A: 



A(F. [v, SI [w, G)) = 



M(^tv, S))-MFAv t Gj) 
HiF. h*,GJ) 



where v and w are weights for the sets of ORFs S and G 
respectively. We call A the 'enrichment' of feature F 
because it indicates whether F is enriched (if A is 
positive) or depleted (if A is negative) in population [v r 5] 
relative to [w, G). 

Usually, the gene set G is defined by the particular 
experiment, for which the weight w was measured. 
However, il is also possible to combine the gene set 
associated with one experiment with expression levels 
from onother set. One may want to do this to compute 
the enrichment only on the genes common to both 
populations, for which there are defined values for both >v 
and v, viz: A(F, \v : S O G). \w t S O G]). In practice, 
this is most relevant for comparing Gp IOI and G m RNA- 
Since Gp, m is completely a subset of G m RNA> we need 
not explicitly deal with intersections if we calculate all 
statistics directly over Gp in i 

One can adjust the weight vectors to take into account 
different types of averaging. For instance, when com- 
puting ihe amino acid composition {F — on) from the 
amino acid compositions of individual ORFs F, = ooj 
(Vy £ G). we weight by ORF length. In the case of : 
expression weights, we have: 

Wj ~ NjWmR&Aj Vj € G 

where Nj is a measuie of the length of ORF /" (such as the 
number of amino acids). 



On the other hand, when computing the average molec- 
ular weight per amino acid, we need to normalize by the 
number of amino ?icids per ORF, which is equivalent to 
choosing the following weights: 



WmRNA,j 



Nj 



V/ e G. 



Application of methodology to quantitative 
abundance sets 

Having defined our formalism, we applied it to a diverse 
set of protein features in yeast. 

Amino acid enrichment. As shown in Figure 2a, we used 
pur methodology to measure the enrichment of individual 
amino acids in both the transJatome arid the transcriptome 
relative to the genome. We found that three amino acids — 
valine, glycine and alanine — were consistently enriched in 
both transcriptome and translatpme populations. 

In Figure 2a we compare different gene Sets. In Fig- 
ure 2b we focus mainly on the variation in enrichments 
when all the comparisons are restricted to the set of 181 
genes (Gp rot n G m RNA — Gp wt ) common to all data sets. 
Thus, the differences between the populations now only 
reflect the effects of differential transcription of certain 
genes and differential translation of certain transcripts. 
We find here an enrichment specifically of cysteine in the 
translatome in relation to the transcriptome. 

To measure the statistical significance of the results on 
amino acid enrichment, we have performed a control anal- 
ysis on a randomized data set (Figure 2d). We randomly 
permutated the expression values of the ORFs 1000 times 
and then recomputed the enrichments. This allowed us to 
compute distributions for the amino acid enrichments and, 
from integrating these, one-sided ^-values indicating the 
significance of the observed enrichments. 

Amino acid enrichment in Transposon data set. We 
also tried to extend our methodology, ineffectively, to 
cope with the semi-quantitative Transposon set. We used 
. only those 450 ORFs that consistently yielded either no 
expression or high expression, as binary data, on or off. We 
show the enrichments of amino acids computed from this 
filtered Transposon abundance ser in Figure 2a. Overall, 
the enrichments from this set seemed to be attenuated in 
comparison to other data. 

Bionwss enrichment, A corollary to amino acid enrich- 
ments is the de terminal ion of the average biomass of the 
transcriptome and translatome populations (shown in Fig- 
ure 2c). We found that the average molecular weight of 
a piotcin in both populations was. on average, lower than 
in the genome population. These preliminary observations 
suggest a cell pie fere nee to use less energetically expen- 
sive prof ems for those that ait* highly transcribed or trans- 
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Fig. 2. Amino acid and biomass enrichment, (a) Shows Ihe amino acid enrichments between different populations as indicated by the legend 
to the right of the plot (the legend is ordexed in the same way as the schematic illustration in Figure I). The bais indicate the enrichment of the 
tianscriptome relative to the genome, whereas the circles indicate the enrichment of the translatome relative to the genome. In addition, we 
also show Ihe enrichment for protein abundance from the Transposon abundance set, represented by the circles with the line through ihem. 
(b) Shows a different view of amino acid enrichment from that contained in (a), now focusing on changes, and thus restricting ihe comparison 
to the genes common to all the data sets. The graph is ordered according to the enrichment from transcriptome to tianslatome (black squares). 
We focus here only on the changes for the abundance gene set (Gp, ot ) to exclude the effects that arise from looking al different subsets. In this 

. view the enrichments from genome to transcriptome (while squares) 3nd from genome to tianslatome (white diamonds) look more similar 
than do the analogous sets in (a). To make comparison with (a) easier we again show the enrichment from genome to the transcriptome for 
the complete gene set (Gcen. shown in bars), (c) Shows biomass enrichment. The le/t panel depicts the average molecular weight per ORF 
(in units of kDa) and the right panel, ihe average molecular weight pei amino acid (in units of Dalions) in each of the thjee stages of gene 
expression. The numbers inside the circles indicate the average molecular weights. The values next to the arrows indicate the enrichments 
in biomass between different populations. Both the ciicle diameters and the mow widths are functions of ihe corresponding values (the 
hollow arrow indicates a positive value). It is very clear thai the average molecular weight per ORF is much lower in the translatome (by 20 
or 15%) and transcriptome (by 2?%) ihan in the genome. This relative depletion of biomass mainly takes place as a result of transcription; the 
effect of translation is less clear, depending on the populations compared. On ihe other hand, the depletion in the average molecular weight 
per amino acid (-3.3% from genome to translatome) is an order of magnitude smaller than in the average weight per ORF. This shows 
that the yeast cell favors the expression of shorter ORFs over longer ones, and agiecs with our earlier observation that there is a negative 
correlation between maximum ORF length and mRNA e.xpiession (Jansen and Gcrsicin, 2000); it seems ihat this effect mainly takes place 
during transcription rather than tianslation. (d)This plot shows that the amino acid enrichments are statistically significant. We have assessed 
significance by landomly permuting ihe expression levels among the genes and then iccompuiing the amino acid enrichments. This procedure 
can be icpeated and used lo generate distributions of random enrichments thai ran then be compared against the observed enrichments. In 
the ploi the gray bars represent the observed enrichments already shown in Figure 3a. On lop of the gray bars we show standard boxplots 
of enrichment distributions based on 1000 random permutations. (The middle line repiesents the distribution median. The upper and lower 

.sides of the box coincide with the upper and lower quariiies. Outliers are shown as dots and defined as data points that are outside the tange 
of ihe whrskets. the length of which is 1.5 ihe interquartile distance.) Based on the random distributions, we can compute one- sided p- values 
for the observed emu hmrnts. Amino acids for which ihe /?- values arc less ihan 10" ' arc shown in bold font.. 



latcd. However, wc also found that the average molecu- 
lar we. it: hi pet nmino odd differed much less between the 
tianse ripiome and ihe iranslniome on ihe one band, and the 



genome on ihe other hand (though it was still slightly Jess). 
This finding indicates thai lower molecular weights in the 
iianslaiome and transcripiome relative to ihe genome aie 
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predominantly due to greater expression of shorter pro- 
teins rather than the incorporation of smaller amino acids. 

Secondary structure composition. We also used our 
methodology to study the enrichment of secondary- 
structural features. Secondary structural annotation was 
derived from structure prediction applied uniformly to all 
the ORFs in the yeast genome as described in Table J. 
As shown in Figure 3a, all three populations — genome, 
transcriptome, and translatome— had a fairly similar 
composition of secondary structures — sheets, helices, and 
coils. The differences between populations were marginal 
and based only on the small subset of genes. 

We also found that Transmembrane (TM) proteins 
were significantly depleted in the transcriptome (see 
website and caption). These results nie consistent with 
our previous analyses (Jansen and Gcrsiein. 2000). The 
protein abundance dala does not have any membrane 
proteins. 

Subcellular localization. Figure 3c shows the enrich- 
ment of proteins associated with the various subcellular ' 
compartments. For clarity, we divided the cell into five 
distinct subcellular compartments, (see Table 1). We 
found that, in comparison to the genome,, both the tran- 
script ome and trans/a tome are enriched in cytoplasmic 
proteins. This is true whether wc innke our comparisons in 



relation to the relatively large reference mRNA expression 
set or the smaller reference protein abundance set. As 
Figure 3c shows, the 2D geJ experiments are clearly 
biased towards proteins from the cytoplasm. However, in 
the biased subset 6*p IO t transcription and translation lead 
to an even higher fraction of cytoplasmic proteins in the 
translatome. 

Functional categories. Finally, we compared the enrich- 
ment of various functional categories in both the trans- 
latome and the transcriptome (see Figure 3b). This gives 
us a broad yet informative view of the ceil as a whole. As 
described in Table I, we used the top-level of the MIPS 
scheme for the functional category definitions. We found 
broad differences between the various populations, with 
some of the functional categories showing strikingly high 
enrichments. 

DISCUSSION AND CONCLUSION 

We developed: (i) a methodology for integrating many dif- 
ferent types of gene expression and ptotein abundance into 
a common ft a me work and applied this to a preliminary 
analysis; (ii) a procedure for scaling :md merging different 
mRNA and protein sets together; and (iii) an approach for 
computing the enrichment of various proteomie features in 
the population of transcripts and proteins. We showed that 
by analyzing broad categories instead of individual noisy 
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Function 



□ Metabolism (1) 
0 Energy (2) 

j-jCet) tpowlh. cell division 

and DNA synthesis (3) 
□Transcription (4) 
D Protein synthesis (5) 
FjCeWutax organization (30) 

Unclassified (98, 99) 
DOthei 



Fig. 3. Breakdown of the transcriptome and translatome in terms of broad categories relating to structure, localization, and function. AJ1 
of the subfigures are analogous to the schematic illustration in Frguie J. (a) Represents the composition of secondary structure in the 
different populations, (b) Represents the distribution of subcellular localizations associated with proteins in the various populations. We used 
standardized localizations deve loped earlier (Draw id and Ger stein, 2000), which, in tin n t were derived from the MIPS, YPD, and SwissProl 
databases (Bairoch and Apweilei, 2000; Costanzo el of. r 2000; Mewes el oK t 2000). The subcellular localization has been experimentally 
determined for less than half of the yeast proteins, so our analysis applies only to this subset, (c) Shows the division of ORFs into different 
functional categories (according to the MIPS classification) in the various populations. Only the laigest functional categories of the top level 
of the MIPS classification aje shown. The group 'other' contains the smaller lop-level categories lumped together. This 'other' group is 
different fiom the group 'unclassified,' which contains genes without any functional description. 



data points, we could find logical (rends in the underlying 
data. For example, individual tianscription factors might' 
have higher or lower piotein abundance than one expects 
from their mRNA express ion. but the category "transcrip- 
tion factors' as a whole has a similar representation in the 
transcriptome. ami tianslatome. 

We found, as pievioitsly described {Futcher et al. 7 J 999; 
Gygi ei nl. 1 999b; Grcenbanm ei af. : ?00l) f a weak 
corr elation between individual measurements of mRNA 
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and protein abundance. The outliers of this correlation 
tend to be associated with ceJhdar organization. One 
might conceive ol using these outliers (i.e. those with 
significantly iliffeient transcriptional and translation^ 
behavior) to rind consensus regulatory sequences. One 
possible method would involve using predicted mRNA 
structures Oacgcj a of., 1990; Zuker, 2000) to find and 
investigate consensus structural elements in these outliers 
to which the yeast iranslational machinery is known to be 
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Table I. Data sets 



Description 



Size iORFs) 



Reference 



mRNA expression 
Young 

Chuich 

Samson 

SAGE 

Reference expression 

Protein abundance 
2-DE#l 

2 DE «2 
Transposon * 

Reference abundance 

Annotation 

Annotated localization 

TM segments- 

MIPS /unctions 

COR secondary structure 



Gene chip profiles yeast cells with 
mutations that affect transcriptfoo 
Gene chip profiles of yeast cells 
under four different conditions 
Comparing gene chip profiles for 
yeast cells subjected to alkylating agent 
Yeast cells during vegetative growth 

Scaling and integrating the mRNA 
expression set into one data source 



Measurement of yeast protein 

abundance by 2D gel 

electrophoresis and mass spectrometry 

Similar 16 2-DE set#l 

Large-scale fusions of yeast genes 
with locZ by transposon insertion 
Scaling and integrating the 2-DE 
data stts into one data sou.ee 



Subcellular localizations of yeast 
proteins 

Predicted TM and 
soluble proteins in yeast 

Functional categories for yeast 
ORFs 

Predicted secondary structure 
yeast ORFs 



5455 

6263 

6090 

3778 
6249 



71 

1410 

181 



2133 
(6280) 

2710 
(6280) 

3519 
(6194) 

6280 



Holstegerfo/. (1998) 
Roth etaL (1998) 
Jeltnslty and Samson (J 999) 
Velctilescurta/. (1997) 

Gygi ero/.(!999a,b) 

Futcher etal (1999) 
Ross-Macdonald et ol (1999) 



Drawid and Gerstein (2000) 
Geistein0998a,b,c) 
Mewes el ol (2000) 
Gerstern(1998a,b.c) 



This ^le provides an overview of the daia sets used "in qui analysis. The table is divided into three sections, fhe .op section lists d.ffcrenl mRNA expression 
sc., The middle section shows the p,otein abundance data sets used. T*e bottom section contains different annotates of protein feMu.es. Fhe column ^Daia 
stf lists a shonhand reference to each data set used throughout this paper. The neM columns contain a brief description of the data sets, the number of ORFs 
contained in each of them, and the Ii.era.ure reference. In control to the other data we investigated, the refe.ence egression and abundance data sets have 
been calculated foi the purpose of our analysis (see text). An expanded version ol the table is available on our web site. 
So«ne funhcr information on the genome annotations: 

Localkaihm. Protein localization information from YPD, MIPS and SwissProt were merged, filtered and standardized (Bairoch and Apweifei. 2000, 
Covtanjo « .if ?000; Mewes el ol, 2000) into five simplified companmenis-cytoplasm, nucleus, membrane, unwellulai (including p.ote.ns in ER and 
polgi). ami mitochondrial-according to the protocol in Drawid al (7000). This yielded a standardized anno.a.ion of pioiein subcellular localization lo, 
7 133 out of 6280 ORFs/ 

7Af in mmt> In 2710 out of 6280 yeast ORFs TM segments arc predicted to occu.. ranging from low to high confidence (73? ORFs). The TM prcd.c.ion was 
Wit oimrrf as follows: the values f.o.n the scale fo. amino acids in a window of size 20 (the typical size of a TM hclu.) wee averaged and then compared 
against a cutoff of - I kcal moT A value under .his cutoff was taken >o indicate .he existence of a TM hrli.v Initial hydrophobic fetches corresponding to 
vignal sequences for membrane insertion were excluded. (These have tire pat.em of a charged residue within the firs, seven, followed by a strrtch of 14 w.th 
an avm« rmbopbobrcity under the cutoff.) These parameters have been used, .cited, and refined on surveys of membrane juo.cm m genomes. Sure 
membrane porrins had a, leas, two TM-scgmen.s with an ave.^c hydio P hubiciiy less than - 2 kcal mol" (Ros. etal.. 1995: Gers.em w al. ,000. Santon, 
ft ni. . 2000; Srnrs (t al. ?000). 

Vumiiam. MIPS fu.rctk.nal categories have been assigned to 3519 <m. ol 6194 ORFs. (The remainder are assigned to category 98' or '99; which 
ccpu-v ponds io unclassified function ) 



tensiiivc (McCarthy, 1998). . 

Jn jetoiron to ftmciional categories, vve found lime 
i.emLs that wcte pailinibtly notable, (i) il>e 'cellular 



Organization/ 'protein synthesis/ and energy pioduciioiV 
categories were increasingly enriched as wc moved Iron 
ocnome to transcripiome to lronstoioine. In the nnnscrip- 
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tome and translatome population relative to the genome; 
(ii) proteins with 'unclassified function' are significantly 
depleted, perhaps reflecting a bias against studying them; 
(jii) proteins in the 'transcription* and 'cell growth, cell 
division, and DNA synthesis' categories were consistently 
depleted. This reflects the fact that many of these proteins, 
such as transcription factors, act as 'switches' such that 
only small quantities of the protein are necessary to 
activate or deactivate a process. These results concur with 
previous calculations (Jansen and Gerstein, 2000) wherein 
we found the transcriptome is enriched specifically with 
proteins involved in protein synthesis and energy. 

Limitations given the small size of the protein 
abundance data 

Even with the extended coverage made possible by 
merging many data sets together into reference sets, the 
analysis is still limited by the minimal data. This was 
most applicable to the protein abundance measurements, 
potentially biasing our statistical results towards certain 
protein families. Moreover, the 181 proteins in Gp,ot do 
not represent a random sample. They are skewed towards 
highly expressed, well-studied proteins. Our methodology 
attempts to control for this gene- selection bias through 
our enrichment formalism, which allows one to rather 
precisely gauge various aspects of the bias. Conversely, 
many protein features in both the translatome and the 
transcriptome are dominated by highly expressed proteins. 
Under these circumstances, it is often sufficient to look at 
this smaller number of dominating proteins to characterize 
the whole population. This is similar to the development of 
the codon adaptation index for yeast (Sharp and Li, 1987). 
While based on only 24 highly expressed proteins, it has 
proven to be robust in predicting expression levels for the 
entiie genome. 

We believe that the essential formalism and approach 
that we develop will remain quite relevant for fuune data 
sets (Smith, 2000). 
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Protein abundancy and inRNA levels of the adipocytc-typc fatty acid 
binding protein correlate in non-invasive and invasive bladder transitional 
cell carcinomas. 

Gromova I . Gromov P , Wolf CcUs JE. 

Department of Medical Biochemistry and Danish Centre for Human Genome Research,' 
The University of Aarhus, Aarhus C, Denmark. 

The adipocyte type fatty acid-binding protein (A-FABP) is a small molecular weight fatty 
acid-binding protein whose expression correlates both with the grade of atypia and the 
stage of bladder transitional cell carcinomas (TCCs). To determine if the protein 
abundancy correlates with the mRN A levels in non-invasive arid invasive lesions, we 
have analysed fresh TCCs (grade H, Ta; grade III, T2-4) by two-dimensional 
polyacrylamide gel electrophoresis (2D-PAGE) and measured the mRNA levels using the 
reverse transcription linked polymerase chain reaction (RT-PCR). Overall, the results 
showed a good correlation between protein abundancy and mRNA levels, indicating that 
the lack of expression of the protein observed in some lesions reflects low levels of 
transcription of the A-FABP gene rather than translational regulation. In addition, our 
studies showed that the loss of A-FABP protein observed in some tumors is not 
compensated by an increase in the skin fatty acid-binding protein PA-FABP, as is the 
case in the A-FABP knockout mice, 
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Abstract The adipocyte type fatty acid-binding protein (A- 
FABP) is a small moleqular weight fatty acid-binding protein 
whose expression correlates both with the grade of atypia and 
the stage of bladder transitional cell carcinomas (TCCs). To 
determine if the protein abundancy correlates with the 
mRNA levels in non-invasive and invasive lesions, we have 
analysed fresh TCCs (grade II, Ta; grade HI, T 2 J by two- 
dimensional poly aery lamide gel electrophoresis (2D-PAGE) 
and measured the mRNA levels using the reverse transcription 
linked polymerase chain reaction (RT-PCR). Overall, the 
results showed a good correlation between protein abundancy 
and mRNA levels, indicating that the lack of expression of 
the protein observed in some, lesions reflects low levels of 
transcription of ihe A-FABP gene rather than translalional 
regulation. In addition, our studies showed thai the loss of A- 
FABP protein observed in some tumors is not compensated 
by an increase in the skin fatly acid-binding protein PA- 
FAB P, as is Ihe case in the A-FABP knockout mice. 

Introduction 

Bladder cancer accounts for 4.7% of all human cancers 
diagnosed. The spectrum of bladder tumors is broad with 
various histological types that include transitional cell 
carcinomas (TCCs), squamous cell carcinomas (SCCs), and 
adenocarcinomas (1-3). TCCs are by far the most prevalent 
tumors as they represent nearly 95% of all bladder cancers in 
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the Western Hemisphere. At First presentation, about 70% of 
the urinary bladder TCCs are diagnosed as differentiated 
superficial lesions that are confined either to the mucosa 
(Ta), or to the underlying connective tissue (T,). The rest 
correspond to highly invasive, poorly differentiated tumors. 

Non-invasive TCCs occur as two distinct growth patterns, 
papillary and non-papillary (flat) lesions (1,2); that display 
significant differences in their malignant' potential and that 
are believed to originate from different genetic alterations 
(4-6). Papillary carcinomas usually correspond to low-grade 
lesions which frequently recur multiple times. These tumors 
begin as areas of hyperplasia thai later undergo a process of 
dedifferentiate (grades 1-IV). Invasive tumors may arise 
from these lesions, but often are derived from non-papillary 
carcinoma in situ that usually is of high grade at presentation 
and tend io invade and progress to muscle invasion and 
metastatic disease. 

To date, many attempts have been made to pinpoint genetic 
changes that underly progression of bladder cancer as well as 
to identify molecular markers that correlate with tumor 
progression. Cytogenetic studies and molecular genetic data 
have shown that chromosomes 3p, 4p, 4q, 5q. 3p, 9p, 9q, 
Hp. 13q, 14q, 17p and I8q are frequently altered in bladder 
urothelial tumors (4,5 and refs. therein), and as a whole they 
have supported the notion that bladder cancer is a multistep 
process. Recently, Spruck e t ot (6) showed that c]:uomosome 
9 alterations occur early during development, while p53 
mutations appear later in the process and confer invasive 
properties. The situation however is reverse in the case of 
Cis, as a large fraction of these lesions contain p53 mutations 
(5,6,8,9). Besides pointing towards two divergent pathways of 
bladder tumor progression, these studies implied that the order 
in which the genetic changes occui is important in detenrurung 
the outcome of the lesion. 

Assessment of bladder cancer is based on a thorough 
pathological examination of biopsy material which establishes 
the histological type of the tumor, its degree of differentiation 
(srade), and depth of invasion of the bladder wall (staging) 
( 11 0 12). In spile of strict criteria for (he pathological assessment 
of these lesions, there exist a significant inter- pathologist 
variation, a fact that emphasises the need for objective 
markers lhat may aid iheir classification. With ihis in mind, 
we are exploring the possibility of usin» protcome (13) 
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expression profiles of these lesions as fingerprints to define, 
iheir grade of alypia and eventually their stage (3J4). So far, 
more than 400 tumors of various grades and stages have been 
analysed by two-dimensional polyacrylamide gel 
electrophoresis (2D PAGE),, and preliminary experiments 
have shown that even though the overall protein expression 
profiles of tumors of the same grade and stage are very similar, 
there are important differences suggesting that morpho- 
logically 'identical' TCCs may be further subdivided (I). Of 
the biomarkers of TCC progression identified so far, the 
adipocyte-type fatty acid binding protein (A-FABP) is 
perhaps one of the most interesting as the levels of this 
polypeptide have been shown to correlate both with the grade 
of alypia as well as with the stage of the disease (3). Given 
the putative importance of A-FABP as a progression marker, 
and since Anderson and Seilheimer (15) recently showed that 
post-transcriptional regulation of gene expression is a frequent 
phenomena in higher organisms, we have compared the levels 
of A-FABP mRNA and protein in non- invasive and invasive 
bladder TCCs expressing and lacking this protein. 

Materials and methods 

Tumors. Fresh bladder tumors were obtained immediately 
after transurethral resection.. The grade and clinical stage of 
Ihe tumors were determined by the pathologist at the Aarhus 
Municipal hospital. Clean tumors devoid of blood clots were 
divided into small pieces for 2D PAGE and DNA, and RNA 
preparation. The latter were immediately frozen in liquid 
nitrogen and store at -80"C unlil use. 

I J5 S]- methionine labeling and 2D- PAGE. In a few cases, 
small tumor pieces were labeled with [ 15 S}-melhionine as 
previously described (3). 2D-PAGE was performed 
according to published procedures (16; see also htlp: 
//biobase.dk/cgi-bin/celis). 

HT-PCR. Frozen tumor samples were ground to powder in 
liquid nitrogen and total RNA was isolated using the acid 
guanidium isothiocyanate/phenol chloroform extraction 
procedure (17). The samples were treated with RNase-free 
DNases I (Pharmacia) to eliminate contaminating genomic 
DNA using the protocols recommended by the supplier. 
PoIy(A)* RNA was prepared using Poly (A) Quick columns 
according to the manufacturer s instructions (Stralagene). The 
synthesis of cDNA for RT-PCR reactions were carried out 
using the Gibco BRL Superscript Kit. Two pg of total RNAs 
was mixed with oligo -dTll primer, PCR buffer, MgCU 
(25 mM), 0.1 M DDT and 10 mM dNTP. The mixiuie was 
incubated at 42"C for 5 min followed by the addition of 
Superscript 31 reverse transcriptase and lunhcr incubation at 
42'C for 50 min. The reaction was terminated by raising the 
temperature to 70*C for 15 min, followed by additional 
incubation at 37'C for 20 min in the presence of RNasc H to 
deplete the RNA. Primers for known and cloned genes were 
purchase from Pharmacia as follows; A-FABP, Upper (from 
186-20.8 bp) 5 ' - G A TC ATC A GT G TG A A T G G G G A T- 3 7 
lower (from 374-397 bp) 5' CATCCTCTCGTiTTCTCrTT 
ATG-3'; B-actm upper 5'-GAGG1TGGCTCTGACTGTACC 
AC-3/lower 5'-CTCATTCAGCTCTCGGAACATCTCC-3*. 



Tabic 1. Expression of A-FABP in non- invasive and invasive, 
bladder TCCs: 
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■The levels of a A-FABP protein were determined based on the 
visual analysis of Copmassie Brilliant Blue stained gels and 
represent the average estimate of at least two different runs. Tumors 
scored as positive differed significantly with respect to the levels of 
the protein, and therefore are indicated with either four (very high), 
three (high), two (medium) and one (low) cross (see also Fig. I); 
b The mRNA levels were determined based on the intensity of 
Ethidium Bromide stained cDNA bands separated on agarose gel 
using the Bio-Rad Gel Doc 1000 system and represent the average 
estimate of al least three independent experiments. Corresponding 
mRNA levels are represented by crosses as described above. 



PCR reactions were performed in a Progene thermal cycler 
using the Advantage Klen Tag Polimerase (Clonlech). The 
cycling parameters consisted of 30 sec of denatmaiion al 
94*C, with annealing of 30 sec at 60X for B-actin or at 
64*C for A-FABP. The extension lime was for 2 min at 68"C 
for 29-40 cycles with the final extension of 7 min al 68"C- The 
PCR products were separated on K5% agarose gel 
electrophoresis followed by ethidium bromide staining and 
photography under UV light. 

Results 

A-FABP protein levels in non-invasive and invasive TCCs. 
One hundred suspected TCCs removed at the Department of 
Urology, Skejby Hospital, were analysed by high resolution 
2D PAGE and Coomassie Brilliant Blue staining. Of these. 
10 grade Jl. Ta TCCs (Table 1) were chosen to correlate A- 
FABP protein and mRNA levels as these lesions yielded 
acceptable protein profiles both in terms of their purity as 
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Figure I IEF 2D gels of whole cellular extracts from non-invasive and 
invasive TCCs. A, TCC 532-1 ; B. TCC-692-l; CTCC 763- I; D. TCC 709-1 
and a TCC 711- 1. Only (he relevant area of ibe gels are shown. 
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assessed by monitoring for ihe absence of vimentin 
(contamination with connective tissue) and desmin 
(contamination with smooth muscle cells), as well as 
polypeptide resolution. In addition, reasonable amounts of 
these tumors were available for mRNA preparation. 

Table 1 shows the levels of A-FABP protein expression in 
the 10 tumors analysed by 2D PAGE The data were scored 
entirely based on the visual analysis of Coomassie Brilliant 
Blue stained gels and represent an average estimate of at 
least two different runs. Tumors scored as positive differed 
significantly with respect to the levels of this protein, and 
therefore are indicated with either four (very high), three 
(high), two (medium) and one cross (low). Representative 
examples of Coomassie stained 2D gels of tumors exhibiting 
very high (TCC 532-1, Fig. I A), high (TCC 692-1; Fig. IB), 
medium (TCC 763-1, Fig. 1C) and undetectable levels (TCC 
709-1 and TCC 711-1 Fig. ID E) of A-FABP are shown in 
Fig. 1 (only ihe relevant area of the gels are shown). 

A-FABP mRNA levels in non- invasive grade !), Ta TCCs. 
Since in many instances only a limited amount of fresh lumor 
was available, we used RT-PCR to determine the levels of A- 
FABP mRNA in the ten TCCs analysed by 2D PAGE (Fig. I). 
Following amplification, the PCR products were analysed by 
conventional \ 5 c 7c agaiose gel electrophoresis and visualised 
by ethidium bromide staining as shown in Fig. 2. The amount 



Figure 2 RT-PCR analysis of A-FABP mRNA expression in non-invasive 
(Crll. Ta) and invasive TCCs (Gr III. T r TJ. For RT-PCR analysis, (he ss c- 
DNA was synthesized by Reverse Transcriptase using lotal RNA. and used 
for RT-PCR ampJificalion. The PCR products were resolved on 1:5* 
agaiose gels and visualised under UV light following ethidium bromide 
staining. The A-FABP panels show the results of amplifications where the 
pair of gene specific primers was used lo generate Ihe 220 bp DNA 
fragment. Amplication of A-FABP was obtained after 30 cycles of PCR. 
The B-aclin panels represent the amplification of the ft-actin gene, which 
was used as an internal control lo confirm thai equal amounts of c-DNA were 
used in each reaction. 



of cDNA in each lane was normalised using several house- 
keeping genes so as to achieve a more accurate assessment of 
the expression of the A-FABP mRNA. As shown in Fig. 2, 
TCC 532- 1 exhibited the highest amount of A-FABP mRNA, 
followed by TCCs 692-1, 763-1, 616-1, 581-1, 154-1, 166-5 
and 533-1. Undetectable levels of A-FABP mRNA were 
observed in the case of TCCs 607-1 and 709-1 (Fig- 2). 
Relative mRNA levels for the ten TCCs are given in Table I. 

A-FABP protein and mRNA levels in invasive grade III T„ 
TCCs. Of the invasive TCCs (grade 111, T 2 J ) analysed by 2D 
PAGE only six yielded reasonable protein profiles for further 
study. As shown in Table I. none of these lesions expressed 
delectable levels of A-FABP as determined by Coomassie 
Brilliant Blue signing (Fig. IE, TCC 71 l-D-ln line with these 
results, the RT-PCR analysis of these tumors also revealed 
undetectable level of A-FABP mRNA (Fig. 2. Gr M T 2 -T 4 ; 
Table J). 

Loss of A-FABP protein is not compensated by cm increase in 
PA-FABP. Recent studies of A-FABP knockout mice have 
shown that the loss of A T FABP in fat tissue is 
by an increase in the skin fatly acid- binding protein mall (W- 
Our studies, however, indicated that the human homologue 
of mall. PA-FABP (19), did not compensate lor the loss oi 
A-FABP eiiher in ihe noninvasive oi ihe invasive tumors 
analysed in this study (Fig. ID and E). In addition. Fig. 3 
shows 2D gels of |»SJ- methionine labeled proteins from two 
.rade 11. Ta TCCs (192-4. T 4 ; Fig. 3A and 192-4. T ; . Fig. 3B). 
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Figure 3. Levels of A-FABP and PA-FABP protein in grade II, Ta tumors 
resected from the same patient. The two upper panels show the 2D gel 
autoradiograms of pS]- methionine labeled proteins from TCCs (grade II. 
Ta) resected from the same patient. A. TCC 192-9 tumor 4 and B,- TCC 
192-9 tumor I. Only the relevant area of the autoradiograms are shown. The 
low panel shows the RT-PCR analysis of A-FABP mRNA expression in the 
same lumors (see also legend to fig. 2). 



which differ significantly in their levels of A-FABP protein 
and mRNA (Fig. 3. low panel). As shown in Fig. 3, the 
decrease in A-FABP observed in TCC 129-4, T, is not 
accompanied by an increase in the PA-FABP protein (Fig. IE). 

Discussion 

Of the TCC progression markers identified to dale, A-FABP 
is perhaps one of the mosi interesting as its presence correlates 
both with the grade of aiypia (p=0.0006) and the stage of the 
disease (p=0.0269) (3). A-FABP is a low molecular weight 
protein belonging to a cytosolic multigene family of lipid- 
binding proteins that include heart, liver, intestinal, muscle, 
brain, skin and epithelial isoforms (20). Members of the 
FABP family ore highly expressed in differentiated cells and 
show narrow tissue distribution. Their precise function is at 
present unknown, although there is evidence suggesting that 
they may play roles in intracellular lipid transport and 
metabolism, signal transduction (21,22) as- well as growth 
control and differentiation (23). The role in signal transduction 
has been inferred from the fact thai long-chain fatty acids and 
their metabolites can act as primary and second messengers 
in specific signalling pathways (24). Recently, it has been 
shown that A-FABP may play a central rote in the pathway 
that links obesity with insulin resistance, most likely by 
connecting the fatty acid metabolism wiih (he expression of 
TNF-a (18). Furthermore, there is evidence indicating that 
the A-FABP °enc contains sequence information necessary 
for differential ion- dependent expression in adipocytes (25). 
Our own data derived fiom the study of TCCs and normal 
uiothelium suggest that A-FABP may be required for normal 
uroihelium differentiation (I), as may be the case for PA- 
FABP in the skin (19). 



Considering the potential prognostic . value of A-FABP 
protein and/or mRNA in TCC progression it waVimpbrtam lo 
determine if ihe levels of both type of macroijiolecules 
correlated both in the non-invasive and the invasive lesions 
expressing and lacking A-FABP. The need for such correlation 
was underlined by recent studies of Anderson and Seilhamer 
(15), who reported a lack of overall correlation between the 
mRNA and protein levels of. 45 rat proteins analysed by 2D 
PAGE in combination with cDNA arrays^ Tftir data yielded a 
correlation coefficient of 0.45 which is half way between weak 
and perfect correlation. Clearly, our data showed a very' good 
correlation between Ihe protein and mRNA levels of A-FABP 
in all tumors analysed indicating that the loss of A-FABP 
protein observed in some tumors is not due to post- 
transcriptiona) regulation. 

Recently, knockout mice carrying a null mutation in the 
aP2 gene encoding for A-FABP was produced (18). These 
animals do not show an obvious morphological or metabolic 
phenotype, but exhibit a 20-fold increase in the levels of the 
keraiiriocyte type FABP (mo/7), which may compensate for 
the loss of the deleted gene (18). The human homologue of 
the mall gene, PA-FABP, was cloned in our laboratory and 
has been shown to be highly upregulated in psoriatic skin as 
well as in abnormally differentiated primary keraiinocytes 
(19). PA-FABP is expressed in normal urothelium together 
with A-FABP (3), and ongoing studies in the laboratory have 
shown that its level decreases significantly as tumors progress. 
Interestingly, the studies reported in this article did hot revealed 
a compensatory up- or down-regulation of PA-FABP in the 
TCCs analysed, supporting the contention that PA-FABP may 
also play a role in cell growth and differentiation (19). 
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wnicn suggests that MIF plays a pivotal role in the pathogenesis of ARDS. 
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HabtiY, Yano I , Okuda M . Fukatsu A . ImiijC 

Department of Pharmacy, Kyoto University Hospital, Faculty of Medicine, Kyoto 
University, Sakyo-lcu, Kyoto 606-8507, Japan. 

We previously reported that in hyperuricemia rats, renal impairment occurred and organic 
""transport activity decreased, accompanied with a specific decrease in the expression - 
^™ r fT aiUOn UaaSp0lters - r0AT1 a*" 1 r OAT3, and organic cation transporter 
ItfSL H PfeSent y ' WC inVesti 8 ated the reversibility of the organic ion transport 
activity and expression of organic ion transporters (s!c22a) during recover from 
hyperuncem.a. Hyperuricemia was induced by the administration of a chow containing 
unc acid and oxomc acid, an inhibitor of uric acid metabolism. Four days after 
discontinuance of the chow, the plasma uric acid concentration returned to the normal 
uu "T* fimct,ons suc h as creatinine clearance and BUN levels were restored 
• although the recovery of tubulointerstitial injury was varied in sites of die kidney ' 
Basolateral uptake of p-aminohippurate (PAH) and tetraethylammonium (TEA) and both 
protein and mRNA levels of rOATl , rOAT3 and rOCT2 in the kidnev prarimllv 
2Zt hT gl4d f ^f^^fro'" hyperuricemia. Basolateral PAH transport 
(rtM™- u COtr ? at,< ? n with the P rotein level of rOATl (r(2)=0.80) than rOAT3 

vro^lm^t7r^T eral TEA taaspoct showed a 5trong corre,ation With *> CT2 

ZTJ^o f ^ P f °,f r ° ne concentratio «. which is a dominant factor in 
W 2T T f f ' WaS graduaUy restored duri "g *e recovery from hyperuricemia 
bu me correlation between the plasma testosterone level and rOCTZ protein expression ' 
Zn^Z Y l™™^^ T »e S e results suggest that the regulation of organic ion 

2££ A i /' ^ "* t ° CT2 ' ty h ^ eru — * -Ssible, and orga! 
!on transport activity restores according to the expression levels of these transporters. 
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Regulation of cytochrome P4501AI in teleosts: sustained induction of 

CYT1A1 mRNA, protein, and catalytic activity by 2,3,7,8- 

teti achlorodibenzofuran in the marine fish Stenotoruus chrysops. 

Halm ME , Stegeman JJ . 

Biology Department, Woods Hole Oceatiographic Institution, Massachusetts 02543. 

Cytochrome P4501 A I (CYPlAl) is known to play important roles in the activation and 
detoxification of carcinogens and other toxicants in vertebrate animals, including fish. 
Although extensively studied in mammalian systems, the regulation of CYP1 A forms in 
other vertebrates is less well understood. We examined the time course and dose- 
response relationships for induction of C YP I Al mRNA, protein, and catalytic activity by 
2,3,7,8-tetrachlorodibenzofirran (TCDF) in die marine fish Stenotomus cluysops (scup). 
The time course of CYP1 Al induction was determined following a single ip dose (10 
nmol/kg) of 2,3,7,8-TCDF. Hepatic ethoxyresorufin O-deethylase activity was increased 
after I day, reached a maximum by 8 days, and was still elevated 14 days after treatment. 
The content of irnmunodetectable CYP I A I protein in liver was elevated on Day I and 
continued to increase through 14 days. CYPlAl protein content was also strongly 
induced in heart and gill beginning at 2 days after treatment and extending through Day 
14. Hepatic CYPlAl mRNA was strongly induced by 1 day after dosing and remained 
elevated through 14 days. The sustained induction of CYPlAl mRNA by 2,3,7,8-TCDF 
contrasts with the transient induction seen previously in fish treated with aonhalogenated 
inducers and most likely reflects differences in persistence of the inducers. Dose-resporise 
studies indicated that induction of CYPlAl mRNA, protein, and catalytic activity 
occurred following doses of 2,3,7,8-TCDF as low as 0.4 nmol/kg (120 ng/kg), within the 
range of whole : body contents of this congener measured in fish from contaminated 
environments. The estimated dose producing hal f-maximal CYP. 1 A 1 induction in scup 
was approximately 2-10 nmol/kg, suggesting that the sensitivity of these fish to induction 
may be as great as or greater than that of rats. In contrast to previous results obtained with 
S^^^'-tetrachlorobiphenyl (TCB) and beta-naphthoflavone, which appear to inhibit or 
inactivate CYPlAl in fish and other vertebrates, there was a good correlation among 
levels of CYPlAl mRNA, protein, and catalytic activity in individual fish following 
various doses of 2,3,7,8-TCDF. The difference in response to 2,3,7,8-TCDF versus 
3,3\4,4'-TCB may reflect differences in the inducing potencies of the two compounds 
relative to their similar potencies as inhibitors of CYPlAl catalytic activity. In additional 
studies to evaluate structure-activity relationships for CYPlAl induction by chlorinated 
dibenzofurans in fish, scup were treated with 2,3,6,8-tetrachlbrodibenzofuran (2,3,6,8- 
TCDF). At 10 or 50 nmol/kg, 2,3,6,8-TCDF was inactive as an inducer of CYPlAl 
mRNA, protein, or catalytic activity.(ABSTRACT TRUNCATED AT 400 WORDS) 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA 
analysis and pS2-protein radioimmunoassay. 

Hahnel E, Robbins P, Harvey j; Sterrett G , Hahnel R . 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical 
Centre, Nedlands. 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-protein 
using a radioimmunoassay, and by determining pS2 : mRNA using Northern blotting. 
There was a good correlation between the two measurements, indicating that expression 
of the pS2 gene in breast tissues may be assessed by either method. Since 
radioimmunoassay is technically easier and more efficient than Northern blotting, 
radioimmunoassay will be the method of choice in routine applications. 
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Brief communication 

Expression of the pS2 gene in breast tissues assessed by pS2-mRNA analysis 
and pS2-protein radioimmunoassay 



Eriica Hahnef, Peter Robbins, Jennet Harvey, Gregory Sterrett and Roland Hahnel - 
Department of Pathology, University of Western Australia, Queen Elizabeth JJ Medical Centre, Nedlands, 
6009, Western Australia 



Key words: breast tissue, pS2-mRNA, pS2 protein, radioimmunoassay 
Summary . 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-prorein using a radioimmu- 
noassay, and by determining pS2-mRNA using Northern blotting. There was a good correlation between the 
two measurements, indicating that expression of the pS2 gene in breast tissues rriay be assessed by either* 
method. Since radioimmunoassay is technically easier and more efficient than Northern blotting, radioim- 
munoassay will be the method of choice in routine applications. 



Introduction 

Expression of the pS2 gene is controlled by estro- 
gen. This was first described in the MCF-7 breast 
cancer cell line []). pS2 expression has since been 
reported to be useful as a prognostic indicator f2, 
3], although this was not confirmed in another se- 
ries {4]. t 

pS2 expression may be assessed in tissue ho- 
mogenates by analysis of pS2-mRNA [5], by ra- 
dioimmunoassay of the pS2-protein [2], or by im- 
munocytochemica) detection of the pS2 protein in 
tissue secuons [5], It was the aim of this study to 
establish the correlation between pS2-mRNA and 
pS2-protein by radioimmunoassay in a series of 
tissues obtained from mastectomy specimens per- 
formed for carcinoma of the breast. Primary breast 
carcinoma tissue, metastatic carcinoma within ax- 
illary Dodes, and macroscopicaJJy benign bxeasx 
tissue were examined. 



Materials and methods 



Breast tissues 



Tissue specimens from mastectomies performed 
for carcinoma of the breast were examined. 32 pri- 
mary breast carcinomas, 10 axillary lymph nodes 
containing metastatic breast carcinoma, and 20 
samples of uninvolved breast tissue were analyzed 
for pS2 expression. 

The primary breast carcinomas were histolog- 
ically classified using a conventional subclassifi- 
cation. The presence or absence of primary tumour 
was assessed. The presence of metastatic carcino- 
ma within lymph nodes studied was verified by 
histological examination of the node remnant after 
sampling. 

'Uninvolved* breast tissue was sampled from 
sites welJ removed from the primary breast tumour 
(usually in another quadrant of the breast), and was 
selected only if the tissue appeared macroscppical- 
Jy unremarkable. Tissue sampling occurred irrime- 
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diately upon anival of the mastectomy specimen in 
the laboratory, with minimal delays between re- 
moval and sampling. 

Tissues for pS2 analysis were snap frozen in liq- 
uid nitrogen and stored at ~70°C until prQcessed. 

Extraction ofRNA and determination of 
pS2-mRNA 

Details of the procedure have been described in pur 
previous paper [6^ Briefly, the deep-frozen tissue 
was homogenized in a micro-dismembraton The 
homogeneous powder, was extracted with guani* 
diniuxmsothiocyanatephenolchloroformiso- 
amyl alcohol, and RNA was precipitated with iso- 
propanol. The washed RNA pellet was dissolved in 
SDS and glyoxylated, and the RNA preparation 
loaded onto agarose gel After electrophoresis the 
gel was capillary blotted onto Zeta-probe mem- 
branes. Membranes were hybridized overnight 
• ., wi* cDNA.probes pS2 and 36B4, which were la- 
Jtk^elcd with [a*P] dCTP by nick translations, 
^P^Vashed membranes were exposed to Kodak X- 
omat AR film. Relative intensities of the mRNA 
bands were assessed visually as not detectable, ve- 
ry weak, weak, medium, strong, and very strong, 
taking the intensities of the ubiquitous 36B4 bands 
into account. 



Radioimmunoassay of pS2-protein 

Deep frozen specimens were pulverized with a mi- 
crodismembrator. The tissue powder was suspend- 
ed in 10 volumes of pH7.5 phosphate buffer. The 
homogenate was cenirifugedin a refrigerated cen- 
trifuge at 4°C for 60 minutes at 2600xg. The su- 
pernatant was removed with a Pasteur pipette, 
carefully avoiding the fat layer on the top. The pro- 
tein concentration in the supernatant was estimated 
by use of the Coomassie dye-binding method [7J. 
An aliquot of the supernatant was diluted to a pro- 
tein concentration between J and 2mg/ml before 
assay of tbe pS2-protein. In one case the protein 
concentration of the supernatant was well below 
Img/ml. 




The estimation of the pS2-protein was perform- 
ed using a solid phase, two-site radioimmunoas- 
say The kits were bought from CIS Biointema- 
tional, Gif-sur-Yvette, France (ELSA-PS2). In this 
method the molecules of pS2 are sandwiched be- 
tween two monoclonal antibodies; tbe first one is 
coated on the ELS A solid phase, the second one is 
radiolabeled with J25-iodine. The radioactivity 
bound to the ELSA is proportional to tbe concen- 
tration of pS2-protein. Details of tbe procedure are 
supplied with the kit {8] " 

Results and discussion 

32 primary , breast carcinomas, metastatic breast 
carcinoma in 10 lymph nodes, and 20 samples of 
benign breast tissue from mastectomies were in- 
vestigated. Two of the carcinomas were of the infil- 
trating lobular type, two were ductal carcinomas in 
situ, one was a multicentric invasive ductal carci- 
noma, all others were invasive ductal carcinomas. 

Examples of pS2 Northern blots have been 
shown in our previous paper [6] which demon- 
strate that undegraded pS2.mRNA can be isolated 
by tbe method used. 

The results of the pS2-protein and pS2-mRNA 
assays are shown on Kg. 1. There was a good corre- 
lation betweeiuhe two types of results. When pS2- 
mRNA could not be delected by Northern blot, 
pS2-protein results were usually below ]ng/mg 
protein (22 of 30), or between J and 3.7ng/mg (6 of 
30). Two were exceptions (7.7 and J4.6); one of 
them could have been due to the very low protein 
content in the cy tosol which would lead to a large 
pS2 value and an associated error. There was no 
explanation for the other high result. Very weak 
pS2-mRNA sigmls on Northern blots correspond- 
ed to pS2-protein values between 1.1 and 19.2 with 
an average of 6.6ng/mg protein (median 5.7); The 
mean and median pS2-protein concentration in the 
tissue with weak pS2-mRNA signals were 14:3 and 
10.7ng/mg protein, respectively. The average pS2~ 
protein concentration increased to 32.7 (median 
31.5) ng/mg protein for tissues assessed as medium 
pS2-mRNA intensity, and to 43.3 (median 53.8) 
n g/n>g protein for tissues with strong or very strong 
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pS2-mRNA signals. These values should be used 
as an approximate guide only, since the number of 
samples in the various groups was fairly small. 
One-way analysis of variance confirmed that the 
means, of the pS2-protein values in the groups 
made up according to their pS2-mRNA signal in- 
tensity, were significantly different (p <ltH). 

If the pS2 gene is expressed, its expression is on 
average greater in breast carcinomas than in unin- 
volved breast tissue. If one takes pS2-protein val- 
ues above 4ng/mg protein as eut-off, the average 
pS2-protein in J4 breast cancers was 343 (median 
• (35.2), while it was only 18.1 (median 13.8) in 12 
uninvolved breast tissue samples.. If the cut-off is 
taken at )dng/mg protein, average pS2-protein in 
breast carcinoma is also about twice the level of 
uninvolved tissue. There were not enough lymph 
node metastases which expressed the pS2 gene to 
allow a comparison with carcinoma or uninvolved 
breast tissue. , 

Recent preliminary results of pS2 by radioim- 
munoassay [91 are similar to ours for breast cancer 
but considerably lower than our results for normal 
breast tissue. 

The values of the pS2-protein measured obvi- 
ously depend on the protein used for calibration. 
We used the pS2-protein standards supplied with 
the CIS kit, which according to the supplier gave 
values from 0 to 740ng/mg protein in a series of 
205 breast cancer cytosols. Previously, a different 
standard had been used for presumably the same 
series of breast carcinomas [2], and a conversion 
factor to current standards is given as 2.8 [8]. 

It was noticed that the correlation between pS2- 
protein and pS2-mRNA was better in breast carci- 
noma specimens than in uninvolved breast tissue. 
This is unexplained, though it could be due to the * 
variable content of cell or tissue types in adjoining 
pans of a specimen, a variation more likely to oc- 
cur in our sampling of non-malignani breast tissue 
compared to sampling of carcinomas. A similar 
variability in breast carcinoma specimens will 
probably have a smaller influence on the pS2 re- 
sults, since the malignant cells - if they do express 
the pS2 gene - contain more pS2-protein than nor- 
mal breast. 
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pS2-mRNA 

Fig. A Con etetion between pS2-protein by radioimmunoassay and 
p$2-mRNA by Northern bloL 0 = 22 results beJow KThe horizontal 
lines indicate the mean values. 
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The role of the epidermal growth factor receptor in sustaining neutrophil 
inflammation in severe asthma. 
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BACKGROUND: The extent of epithelial injury in asthma is reflected by expression of 
the epidermal growth factor receptor (EGFR), which is increased in proportion to disease 
severity and is corticosteroid refractory. Although the EGFR is involved in epithelial 
growth and differentiation, it is unknown whether it also contributes to the inflammatory 
response in asthma. OBJECTIVES: Because severe asthma is characterized by 
neutrophilic inflammation, we investigated the relationship between EGFR activation and 
production of IL-8 and macrophage inhibitory protein-1 alpha (MIP-1 alpha) using in 
vitro culture models and examined the association between epithelial expression of IL-8 
and EGFR in bronchial biopsies from asthmatic subjects. METHODS: H292 or primary 
bronchial epithelial cells were exposed to EGF or H202 to achieve ligand -dependent and 
ligand-independent EGFR activation; IL-8 mRNA was measured by real-time PCR and 
IL-8 and MlP-lalpha protein measured by enzyme-linked immunosorbent assay 
(ELIS A). Epithelial IL-8 and EGFR expression in bronchial biopsies from asthmatic 
subjects was examined by immnnohistochemistry and quantified by image analysis. 
RESULTS: Using H292 cells, EGF and H202 increased IL-8 gene expression and 
release and d\is was completely suppressed by the EGFR-selective tyrosine kinase 
inhibitor, AG 1 478, but only partially by dexamethasone. MIP-1 alpha release was not 
stimulated by EGF, whereas H202 caused a 1.8 -fold increase and this was insensitive to 
AG1478. EGF also significantly stimulated IL-8 release from asthmatic or normal 
primary epithelial cell cultures established from bronchial brushings. In bronchial 
biopsies, epithelial IL-8, MDP-1 alpha, EGFR and submucosal neutrophils were all 
significantly increased in severe compared to mild disease and there was a, strong 
correlation between EGFR and IL-8 expression (r = 0.70, P < 0.001). CONCLUSIONS: 
These results suggest that in severe asthma, epithelial damage has the potential to 
. contribute to neutrophilic inflammation through enhanced production of IL-8 via EGFR- 
dependent mechanisms. 
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polymorphic expression. 
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Meriiidividual variation in the expression of human microsomal epoxide hydrolase 
(mEH) may be an important risk factor for chemically induced toxLties mdudhL 
cancer and teratogenesis. In this study, phenotypic viability and mEH genetic 
j£S£S^r^ " 3 - ° f 40 ^-^y humanTv^ampLes. 

Messed uX ^ *** ewUuated in paralIeL En zymatic activity was 
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^4-fold vanation determined for mEH protein content within the same JpS 

were poorly correlated (7<o^3) with 

«Tfl3E5?2? P T exc,us 'vely account for variation in enzymatic 

activity, although tfus conclusion is confounded by heterozygosity in me^amnles These 
^demonstrate the extent of hepatic mEH functional variability m wetpSv^ 
human tissues and suggest that polymorphism of mEH protein expression h 3ld in 
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Abstract 

Lung cancer is Ihe most common cause of cancer death 
in developed countries. The prognosis is poor, with less than 
J5% of patients surviving 5 years after diagnosis. The poor 
prognosis is attributable to lack of efficient diagnostic meth- 
ods for early detection and lack of successful treat nun I for 
metastatic disease. Most patients (>75%) present with stage 
1)1 or JV disease and are rarely curable with current ther- 
apies. Within the last decade, rapid advances in molecular 
biology, pathology, bronchology, and radiology have pro- 
vided a rational basis for improving outcome. These ad- 
vancements have led to a better documentation of morpho- 
logical changes in the bronchial epithelium before development 
of clinical evident invasive carcinomas. This has changed our 
concept of lung carcinogenesis and emphasized the multislep 
carcinogenesis approach on several levels. Combined with 
Ihe technical developments in bronchoscope techniques, 
e.g., laser-induced fluorescence endoscope (IJFE) bronchos- 
copy, we now have improved methods to localize preinvasive 
and early-invasive bronchial lesions. With the LIFE bron- 
choscope, a new morphological entity (angiogenic squamous 
dysplasia) has been recognized, which might be an impor- 
tant biomarker and target for antiangtogenic rhemopreven- 
tive agents. To reduce Ihe mortality of lung cancer, these 
new technologies have been taken into Ihe clinic in different 
scientific settings. The use of low- dose spiral computed to- 
mography in Ihe screening of a high-risk population has 
demonstrated Ihe possibility of diagnosing small peripheral 
tumors that are not seen on conventional X-ray. A shift in 
the therapeutic paradigm from targeting advanced clinically 
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manifest lung cancer toward asymptomatic preinvasive and 
early- invasive cancer is occurring. The present article re- 
views the recent advances in the diagnosis of preinvasive and 
early-invasive cancer to identify biomarkers for early detec- 
tion of lung cancer and for chemoprevention studies. 

Introduction 

Lung cancer is the mosl common cause of cancer deaths in 
the countries of North America and other developed countries, 
accounting for 29% of all cancer deaths and more deaths than 
from prostate, breast, and colorectal cancer combined in the 
United States (I). Lung cancer, will be diagnosed in -170,000 
new patients in the United States in the year 2000, and < 1 5% of 
them will survive 5 years after diagnosis (1). The prognosis for 
the patients with lung cancer is strongly correlated to the stage 
of the disease at the time of diagnosis. Whereas patienls with 
clinical stage I A disease have a 5-year survival of about 60%, 
the clinical stage H-1V disease 5-year survival rate ranges from 
40% to less than 5% {2). Over two- thirds of the patients have 
regional lymph-node involvement or distant disease at the lime 
of presentation (3). The poor prognosis is largely attributable to 
the Jack of effective early detection methods and the inability to 
cure metastatic disease. The unsatisfactory cure rates supports 
efforts aimed at early identification and intervention in lung 
cancer. 

Historically, the only diagnostic tests available for the 
detection of lung cancer in its early stages were chest radiogra- 
phy and sputum cytology. The efficacy of these tests as mass 
screening tools was evaluated in controlled trials sponsored by 
the NCI 3 and conducted at Johns Hopkins University , Memorial 
Sldan-Kctlcring Cancer Center, and the Mayo Clinic during the 
1970s (4-6). The principal goal of these studies was to deter- 
mine whether a i eduction in lung cancer mortality could be 
achieved by adding sputum cytology testing to annual screening 
by chest radiography. Results from these trials showed that both 
tests could detect piesympiomotic, eaily-siage carcinoma, par- 
ticularly ol squamous cell type. Kesectabiliry and survival rates 
were found to be genet ally higher in the study groups than in the 
control groups. However, improvements in readability and 
survival did noi lend to a reduction in overall lung cancer 
mortality, ihe most critical end point. A subsequent snidy ol 
6346 Chechoslovakian male smokers also found no i eduction in 
him; cancer mortality after dual screening by chesi radiography 
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and sputum cytology (7). The negative results from these 
'■ screening studies lead the NCI and other health policy and 
research groups to conclude that mass screening programs in- 
volving periodic sputum cytological evaluation and chest radio- 
graphs could not be justified. However, cbntroversies in the 
methodology and interpretation of the data from these studies 
have later been extensively discussed (8, 9). One additional 
study of annual chest X-ray screening is currently being con- 
ducted by the NCI; The Prostate-, Lung-, Colorectal-, and Ovar- 
ian (PLCO) screening trial. This trial includes individuals 55-74 
years old, but they are not selected for this trial on the basis of 
high risk for lung cancer (e.g. r smoking history with >20 
pack-years). 

The failure of clinical trials to demonstrate the efficacy of 
sputum cytology and chest radiography as mass screening tools 
has resulted in a search for better diagnostic approaches for 
early lung cancer detection that take advantage of recent devel- 
opments in molecular biology, gene technology, and radiology 
(10). Furthermore, as has been the case for mammography 
screening for breast cancer, it has also been important to identify 
risk groups for lung cancer. 

Although, much is known about predisposing factors, nat-. 
ural history, and the outcome based on histology and stage, our 
understanding remains very incomplete in many areas. What are . 
the early premalignant changes molecularly, biochemically, and 
• morphologically? Which changes are reversible and which are 
not? What research tools are available to provide answers to 
these questions? The identification of preinvasive lesions allows 
for developing promising methods for early intervention (JJ). 
The therapeutic paradigm and focus are today shifting from 
targeting only clinically verified Jung cancer as previously to- 
ward targeting. the premalignant and early- malignant lesions. 
Furthermore.' the prospect of lung cancer screening has today 
become more meaningful as a consequence of recent develop- 
ments in biology and radiology and better possibilities to define 
high-risk populations most suitable for lune cancer screening 
02). ^ 

The present article will focus on the clinical perspectives of 
our biological knowledge of premalignant and early- malignant 
lesions and the potential of the recent technological advance- 
ment for early diagnosis of lung cancer. 

Pathology of Preinvasive and Early Invasive 
Bronchial Lesions 

Most of the efforts to classify lung cancer have been 
directed toward invasive carcinoma ()3). However, better un- 
derstanding of the pathogenesis of lung cancer aroused renewed 
interest in morphological abnormalities that fall short of inva- 
sive carcinoma but may indicate initiation of carcinogenesis. 
These moiphological abnoimalities are referred to as preinva- 
sive lesions and are shown in Fig. I. 7 he last edition of the 
WHO classification of lung tumors included the classification ol 
preinvasive lesions as a separate section. Numerous recent stud- 
ies have indicated that lung cancel is not the result of a sudden 
transforming cent m die bronchial epithelium but a mulristcp 
process in which gradually accruing sequential genet if and 
cellular changes result in the fori nation of an invasive (:.e.. 
malignant) tumor Mucosal changes in the la ice airways that 



may precede or accompany invasive squamous carcinoma in- 
clude hyperplasia, metaplasia, dysplasia, and'CIS (14); Hyper- 
plasia of the bronchial epithelium and squamous metaplasia 
have generally been considered reversible, and not premalignant 
. in the sense of squamous dysplasia and CIS (15). 

Squamous metaplasia is a common finding, especially as a 
response to cigarette smoking. Peters et oL (16) studied bron- 
choscope biopsies from six sites in 106 heavy cigarette smok- 
ers; Squamous metaplasia was noted at one or more biopsy sites 
in approximately two- thirds of the group, and one- fourth 
showed squamous metaplasia in three or more biopsy sites. The 
incidence of squamous metaplasia increased with smoking his- 
tory and was highest in individuals who had smoked more than 
two packs of cigarettes a day. Auerbach et al (J7) noted similar 
. findings in autopsy tissues: basal cell hyperplasia and squamous 
metaplasia are increased in smokers in proportion to smoking 
history. Hyperplasia and metaplasia are believed to be reactive 
changes in the bronchial epithelium, as opposed to true preneo- 
plastic changes (17, 18). The reasons for this include: (a) they 
are frequently found in association with chronic inflammation, 
and may be induced by mechanical trauma; (b) they spontane- 
ously regress after smoking cessation; (c) in chronic smokers, 
the molecular changes present in these lesions are similar to 
those present in histologically normal epithelium; and (d) there 
are no reports linking their presence to increased risk for devel- 
oping lung cancer. In contrast, moderate-to-severe dysplasia and 
CIS lesions seldom i egress after smoking cessation (19). 

Dvsplasia and CIS are changes that frequently precede 
squamous cell carcinoma of the lung. Saccomanno et ai (20) 
studied more than 50,000 samples from 6,000 men. many of 
whom had worked in the uranium mining industry. Both smok- 
ing and uranium mining (radon exposure) .were found to be 
associated with increased incidence of dysplasia, CIS, and in- 
vasive cancer. The studies of Saccomanno el al. established that 
increasing degrees of sputum atypia may be recogni2ed an 
average of 4-5 years before the development of frank lung 
carcinoma. 

Another question is: which grades of sputum atypia pro- 
gress, to cancer? Fiom the Johns Hopkins cohort of the NCI 
chest X-ray/sputum screening trial. we know that among indi- 
viduals with moderate arypia on sputum screening. -10% de- 
veloped known cancer up to 9 years later. Among individuals 
with severe arypia on the sputum screening. >40% developed 
known cancct during the same time period (21). Although there 
are data in the literature showing the relationship between 
sputum arypia ami subsequent invasive cancer, there is slrll very ■ 
little information about the histological progression in the bron- 
chial mucosa in the high risk populations. In a recent publica- 
tion, rune patients with CIS were followed with autofluorcs- 
cence bronchoscopy at legular intervals, and 5 (56%) had 
propr ess ion to invasive cancer despite endobronchial therapy 
(??) "Mr- number of invasive cancers might even have been 
higher if treatment had not been not given. Ongoing studies of 
high-risk subjects (ri;„ the Coloindo sputum cohort study) 
including senal follow- up bronchoscopies w ill provide evidence 
related to the frequency o( development of invasive lung cancer 
as n lelaufs n> smoking history-, airflow obs miction, and sputum 
atypia 

Since the previous WHO- classification was published m 
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Fig. I A t squamous metaplasia. The 
cells are widely dispersed, wilh a reg- 
ular maturation from ibe basal region 
to ihe top. There is keratin 123 1 ion, and 
the nucJei/cy*oplasmic ratio is low. B, 
moderate dysplasia with ASD. Hyper- 
celrulariry of the epithelium wilh in- 
complete maturation and micropapil- 
lary invasion of capillaries are seen. 
The nuclei/cytoplasmic ratio is high. 
C, severe dysplasia. There is marked 
pleomorphism of ihe cells with irregu- 
larity and prominent nucleoli. 
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J 981, two nonsquamous lesions have been added lo ibe WHO 
classification of premalignant lesions: atypical alveolar hyper- 
plasia and diffuse idiopathic neuroendocrine cell hyperplasia 
(13). Both of these lesions are diagnosed rarely. The former 
consists of lesions <5 mm in diameter and composed of a 
peripheral epithelial cell proliferation with minimal cytological 
atypia or stromal response and resembles bronchioloalveolar 
carcinoma! The lesion has been seen »n hirig specimens resected 
for lung cancer, but no prospective significance, of this lesion 
has been reported. However, this morphological lesion may play 
a role for the pathogenesis of peripheral Jung adenocarcinomas 
(23, 24). The resolution of spiral CT (currently about 3 mm) 
approaches the diameter of these lesions, and it is anticipated 
that atypical alveolar hyperplasia will be increasingly encoun- 
tered in subjects undergoing this procedure (25). Diffuse idio- 
pathic neuroendocrine cell hyperplasia consists of a patchy 
increase in the number of well-differentiated neuroendocrine 
cells in the bronchioles. This process may result in the formation 
of small carcinoid tumors, and for this reason it is considered 
. "preinvasive." To date, small cell carcinomas have not been 
associated with this lesion (13). 

Recently, the use of fluorescence bronchoscopy (see be- 
low) has increased the recognition of dysplastic lesions in ; Ibe 
large airways and a new morphological entity, ASD, was iden- 
tified (26). Dysplasia of bronchial epithelium in "micropapillo- 
matosis" and the possible link between angiogenesis and prein- 
. vasive bronchial epithelial dysplasia were recognized as early as 
1983 by MuIIer and Muller (27); who also described the ultra- 
structure of these lesions. Jl has been suggested that ibis angio- 
genesis, which is recognized as capillary loops projecting into 
the dysplastic bronchial lining, is responsible for the reduced 
fluorescence seen in dysplastic lesions by LIFE bronchoscopes 
(Tigs. 1 and 3; Rcf 26). Funue prospective studies will show 
whether this morphological enriry is correlated with a progres- 
sion to lung cancer so as to be a target for the use of arrtiangio- 
genic agents for chemoprcvention. 

In general, there are several questions/problems relating to 
premalignant lesions, which will be addressed in future studies: 
(a) The morphological criteria for premalignant and early- 
malignant changes, both on sputum cytology and in bronchial 
biopsies, have to be validated for intra- and inter observer repro- 
ducibility. 

{b) Unifotm and repiodncrble morphological/cytologies J 
criteria have to be published more extensively, and o rminim: set 
of slides should be available. By the use of Internet technology, 
this could be mote easily facilitated (78). 

(c) The correlation of sputum atypia and hist olor.icaj 
changes in (he bionchi in high-risk population is not well 
defined. 

id) The natural course ol preinvasive changes in the bron- 
chi from the high risk subjects needs to be clarified through 
longitudinal, prospective studies with reference to histological 
chances in the bronchi. Ongoing longitudinal studies with flu- 
orescence bronchoscopy ami multiple biopsies with hisMogv 
and other bioinarkers will define the ability of these markers to 
assess for risk. 

{e) What is the pathology ^mlf-gv of ihe small, often pe- 
ripherally located, mmois f3 nun rn diameter 1. which aie mote 



often diagnosed with newer radiological techniques (e.g., low- 
dose spiral CT}? 

(/) Optimization of the tissue procurement and processing 
techniques are important. Distinction of reactive from neoplastic 
processes is usually straightforward, but diagnostic difficulties 
may arise in the case of (a) inadequate or poorly prepared 
histological materia] to evaluate and {b) the presence of cyto- 
logical atypia in epithelium, stimulated by inflammation, viral 
infection, radiation, or chemotherapy. 

(g) DNA array analyses of gene expression; will it be 
useful? How to collect proper mRNA? Can mRNA extracted 
from microdisseeled cells obtained at bronchoscopy be globally 
amplified and still remain representative of RNA present in siftt? 

Biology of Lung Carcinogenesis and Potential Early 
Detection Markers. 

Tung cancer is the end- stage of multiple-step carcinogen- 
esis, in most cases driven by genetic and epi gene tic damage 
caused by chronic exposure to tobacco carcinogens. The genetic 
instability in human cancers appears to exist at two levels: at the 
chromosomal level, including large scale losses and gains; and 
at the nucleotide level including single or several base changes 
(29). Lung cancers harbor many numerical chromosomal abnor- 
malities (aneuploidy) and structural cytogenetic abnormalities 
including deletions and nonreciprocal translocations (30). At 
least three classes of cellular genes are involved: proto- onco- 
genes, TSGs, and DNA repair genes. Oncogenic activation often 
occurs via point mutations, gene amplification, or chromosomal 
rearrangement, whereas TSGs are classically inactivated by the 
loss of one parental allele combined with a point or small 
mutation or aberrant methylation of a target TSG in the remain- 
ing allele. Additionally, dysregulated gene expression (either 
increased or decreased expression) can occur by other, as yet 
unknown, mechanisms (30). Present studies have not yet con- 
finned a prominent role for abnormalities of DNA repair genes 
in Jung cancer. 

Preneoplastic cells contain several molecular genetic ab 
normalities identical to some of the abnormalities found in overt 
lung cancer cells (Fig. 2). These include allele loss at several 
loci {3p, 9p. t>p, and }7p), myc and ras up-regulation, cyclin Dl 
overexpression, p53 mutations, 'and increased immunoreactiv- 
iiy, bc!-2 overexpression and DNA aneuploidy (3J-35). AJIelo- 
typing of precisely miciodissectcd,. preneoplastic foci of cells 
suggests that the earliest changes in Ihe bronchial epithelium is 
allele Joss at chromosome regions 3p, then 9p, Sp, I7p. 5q. and 
then ras mutations (36-39).Thc biological meaning of LOH is 
only vaguely understood. Recent evidence suggests (hat I. OH 
may be a consequence of mitotic recombination, that there is 
only infrequent physical loss of genetic loci, and thai l.OH 
probably precedes chromosomal duplicatron HO). Allelic loss 
would thus be significant primarily in the presence of mutation 
in the retained allele, and gene dosage would not be expected to 
exert a pbenoiypic effect in LOH. Some reports have indicated 
thai rus actuation occurs at early carcinoma stages (3<l). His- 
lulngically normal bronchial epithelium adjacent to cancers has 
also been shown to have certain genetic tosses. Atypical :ide- 
noinaious hyperplasia, the potential precursor lesion ol adeno- 
cnicinomris. often have Ki-/ym mnirmons HI). 
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Fig. 2 Sequential changes during lung cancer pathogenesis. Although multiple genetic markers ore abnormal in lung cancers, their appearance during 
the lengthy preneoplastic process varies. The timing of the appearance of these changes has been investigated in bronchial prcneoplasia, because 
sequential sampling of the peripheral lung is technically difficult. Several alterations have been described in histologically normal bronchial epithelium 
of smokers. Other changes have been detected in slightly abnormal epithelium {hyperplasia, metaplasia) which we do not consider to be inre 
premalienant lesions. These changes are regarded as early changes. Molecular changes delected frequently in dysplasia are regarded as intermediate 
in timing, whereas those usually detected at the CIS or invasive stages are regarded as lale changes. Il should be stressed (hat although there is a usual 
order, exceptions regarding the timing of onset may occur. Some changes are progressive, such as chromosome 3p deletions. Thus small disci ete 
changes are present early, progressively become more extensive during pathogenesis, and frequently involve all or almost alt of the arm in CJS 
samples. Although allelic loss at the TP53 locus may precede the onset of mutations, data on this sequence are scant. Dysregulation of the RNA 
component of telomerase (with its appearance in noobasal cells) is an early event, whereas up-regulation of the gene is a relatively late event. 
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Molecular changes have been found nol only in the Junes 
of patients with lung cancer, but also in the lungs of current and 
former smokers without lung cancer (J8 r 42. 43). These obser- 
vations are consistent with the multistep model of carcinogen- 
esis and "field cnnceriz-iiion*' process, whereby the whole region 
is repeatedly exposed to carcmogcnic damage (tobacco smoke) 
and is at risk for developing multiple, separate, clonal I v unre- 
lated foci of neoplasia. The widespread ancuploidy that occujs 
throughout the respirator) tree of smokers supports (his theory 
(•14). However, the presence of the same somatic p53 point 
mutation at widely dispersed preneoplastic lesions in a smoker 
without invasive lung cancer indicates that expansion of a single 
progenitor eh_>ne may spread throughout the respii.itorv tree 
(•IS). These molccntnr alterations might thus be important 
targets lot use m the early detection of I uric cancer and lot use 
as surrogate biomarkers in the follow-up of chemopicvcniron 



studies. Detection of these mutant cells should be possible with 
the different molecular techniques in accessible specimens. The 
prospects of diagnosing these biological abnormalities in mul- 
tiple types of clinical specimens are discussed below. 

Specimens for Clinical Testing; Sputum 

Since the 1930s, cyiological examination of sputum has 
been used for the diagnosis of lung cancer [AO). Cytologic a I 
examination of sputa, especially multiple samples, is helpful for 
the detection of central tumors arising from the larger bronchi 
{e.g. squamous celt- and small cell carcinomas). H.xfoliated 
cells I torn peripheral tumors, such as adenocarcinomas, ansim: 
Irom the smaller airways (small bronchi, bronchioles, and alve- 
oli), especially those less than 2 cm m diameter, can be deteefrd 
only occasionally in sputum samples. This has become <d 
gi eater importance* because the changes in cigaictit* to. position 



JO Review: Advances in Early Detection of Lung Cancer 



(filters and decreased nicotine content) have created an increase 
in adenocarcinomas and a decrease in squamous carcinomas 
(47-49). The sensitivity of sputum cytology for early lung 
cancer is only in the 2O%-30% range from screening studies, 
but by adhering to proper specimen collection, and processing 
and interpreting criteria, the yield can be substantially improved 
(50, 51). The data on the reliability of the sputum are conflicting 
(52-54). Browman et al (52) reported interobserver agreement 
of 68% for exact and 82% for within - "J- category. HoJJiday et 
al (54) reported low agreement within observers (27- 60%) and 
across observers (13-50%). Within - 1 - category intraobserver 
agreement underwent a two- or 3- fold increase in agreement, 
which was also the case for interobserver agreement. The var- 
iation in intra- and interobserver agreement seems to depend on 
experience among the cytotechnicians/cytopatbologists and the 
composition of categories studied. A higher degree of agreement 
is obtained for higher grades of dysplasia (54). Risse et al (55) 
showed that the ability to detect premalignanl conditions is 
dependent on the number and type of cells present in the deeper 
airways, suggesting a mode of improvement that is unrelated to 
observer reliability. M3cDbugalI et al (56) concluded that spu- 
tum cytology was too insensitive and insufficiently accurate to 
be included in the routine work-up of any patient suspected of 
having lung cancer. To improve the reliability of sputum cytol- 
ogy examinations a simplification of the diagnostic categories 
from .6 (normal; squamous metaplasia; mild, moderate, and 
severe atypia; and carcinoma) to 2-3 categories have been 
proposed (54). Future clinicopalhological studies will be re- 
quired to validate this concept. 

To improve the sensitivity of sputum examination as a 
population-screening tool for the detection of early lung cancer, 
several approaches are currently under development. 

Jmmnnosloining. Annual sputum specimens obtained 
from individuals screened at Johns Hopkins were obtained, and 
ihe patients were monitored for 8 years (57). Because the 
clinical outcome of these patients was known, archival sputum 
specimens were screened for the presence of biomarkers that 
could indicate the presence of lung tumors in an early, preinva- 
sive stage. In an attempt to distinguish the pattern of marker 
expression Tockman et al (58) studied two monoclonal anti- 
bodies. Positive staining predicted subsequent lung cancer ap- 
proximately 2 years' before clinical recognition of the disease, 
with a sensitivity of 91% and n specificity of S8% (58). One of 
these antibodies (703 D4) had a higher sensitivity rmd was later 
identified ns recognizing hnRNP A 2/1)1 (59). The 'role of 
hnRNP A2/B1 over expression for detecting preclinical lung 
cancer has been studied in a large high risk population including 
• 6000 Chinese tin miners who were heavy smoke is and who had 
an extraordinary talc of lung cancer t60). 7 he results from, this 
study indicated thai detection of hnRNP A 2/ 13 1 over ex pi ess ion 
in sputum epithelium cells was 2- to 3- fold rnoie sensitive for 
detection of lung cancer than standard chest X-ray and sputum 
cytology methods. The method was particularly effective in 
identifying early drscase (60) The sensitivity was 7-t% r.v.*iu 
?)% for cytology and 42% for chest X-ray. H.y.wve,. die 
biomaikcr bad a kmei specificity [70% r compared ujth otol- 
ogy (J 00%) and chest radiograph (90Vi. An ongoing clinical 
trial is evaluating ihe per form ;mcc ot the A21.U protein ns a 
biornaikei for the early detection of SPI.C "Ihe patients at ii : .k 



for SPLC have the highest incidence of lung cancer (2-5%) 
among asymptomatic populations (61). In this trial, 13 SPLCs 
were identified by A2/BI, and the sensitivity and specificity 
were 77-82% arid 65-81%, respectively. Among the cases 
. identified as positive by immujiocylocherru'srry and image cy- 
tometry, 67% developed SPLC within I year (62). Whereas the 
previous immurtocytochemistry studies on material from Ihe 
older screening material from the NCI-supported screening 
studies were made on sputum cells cytologically classified with 
moderately or gravely atypical metaplastic appearance, the latter 
studies have been done on cytologically "normal appearing" 
cells. More recently Sueoka et al (63) reported the confirmation 
of the value of overexpression of hnRNP A2/BI to delect 
preclinical lung cancer in Japan. Efforts to improve the sensi- 
tivity of hnRNP markers are ongoing (64). 

PCR Techniques. PCR techniques have been used for 
the evaluation of molecular biomarkers for early rung cancer 
detection. In a pilot study with selected patients from the Johns 
Hopkins Lung Project (JHLP), 8 (53%) of 15 patients with 
adenocarcinoma or large cell carcinoma were detected by mu- 
tations in sputum cells from 1 to 13 months before clinical 
diagnosis (65). However, the method seemed to be less sensitive 
than the protein marker described above, and the identification 
of specific gene abnormalities is further limited by the need to 
know the specific mutation sequence with which to probe the 
sputum specimens. Currently, this approach is not practical for 
screening undiagnosed individuals. Future advances in gene 
chip technology may permit testing for all possible mutations of 
common oncogenes and TSGs in clinical specimens of asymp- 
tomatic individuals (62). 

Microsatellile markers are small repealing DNA sequences 
found in the noncoding regions of a gene. PCR amplification of 
these repeat sequences provides a rapid method for assessment 
of LO>l and facilitates the mapping of suppressor genes (66, 
67). MicrosDteJlile alterations are extension or deletions of these 
repeated elements. Detection of microsalellite alterations in 
histological or cylological specimens may facilitate the detec- 
tion of clonal preneoplastic or neoplastic cell populations. Al- 
though the detection of microsalellite alterations does not indi- 
cate the specific genetic change in the rumor, detection of clonal 
cell populations might serve as a cancer screening marker (65). 
Identical alterations have been found in Jung cancers and cor- 
responding sputum samples demonstrating minimal atypra (68). 
'Ihe pib gene is located on the short arm of chromosome 
9(9p21) and is frequently mutated or inactivated in nmrors and 
cell lines derived from lung cancer (09. 70). BcJinsky et ol (71) 
measured hypermcihylatjon of the CpG islands in the sputum of 
lung cancer patients and demonstrated a high coir elation with 
early stages of non-small cell lung cancer, which indicated that, 
pi 6 CpG hypernieihylaiion could be useful in the prediction of 
(unite Jung cancer. However, prospective studies arc needed lo 
evaluate the role of pf6 hypcnneihy lotion as a marker for early 
lung cancer detection. Multiple other genes ate inactivated by 
hypermethybiion in lung cancer (72), and the detection of 
hypcirneihylation may be useful for risk assessnieni and entlv 
diagnosis. 

Compntrr-assisted Image Analysis. Computer- assisted 
image analysis was initially used to detect malignancy associ- 
ated changes [eg., sub visual or noriobvjotis < han^es in the 
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distribution of DNA in the nuclei of histologically normal cells 
in the vicinity of preinvasive or invasive cancer; 73). In a 
retrospective analysis of sputum cytology slides, malignancy- 
associated changes alone correctly identified 74% of the sub- 
jects who later developed squamous cell carcinoma (74). The 
technique has been improved, and recent data showed sensitiv- 
ities of 75% for stage 0/1 lung cancer and 85% for adenocarci- 
nomas with a specificity of 90% (75). This quantitative micros- 
copy technique allows the examining of thousands of cells per 
slide within a relative short time. Similar techniques have been 
approved in the United Stales for cervical cancer screening, and 
might, in the future, play a role for long cancer screening. 
However, no prospective clinical studies has evaluated this 
technology in a larger lung cancer screening setting: 

High Throughput Technology. With future advances in 
jgene chip technology, it might become feasible to probe for 
expression of multiple genes in sputum specimens of asymp- 
tomatic individuals. However, this requires a large amount of 
undegraded RNA from respiratory tract cells. With the high 
throughput technology, a higher sensitivity might be achieved 
by using multiple markers at the cost of achieving a lower 
specificity^ which would be undesirable for a screening study. 

In conclusion, we need to reevaluate the role of sputum 
cytology for screening and early detection of lung cancer be- 
cause of advances in biomarkers and technology. Ongoing stud- 
ies with standard and biomarker analysis in high-risk groups 
might change the previous negative attitude and provide a new 
perspective on sputum cytology as a mass screening tool when 
applied in a high-risk population. Adding different molecular 
diagnostic tests gives the possibility for early diagnosis far in 
advance of clinical presentation. However, validation of the 
tests in larger prospective studies is necessary, and the individ- 
ual tests have to be compared with each other to define the role 
of early diagnosis in ibe overall management of high-risk sub- 
jects. Furthermore, health economic issues have to be consid- 
ered. 

Specimens for Clinical Testing; BAL 

BAL involves (he infusion and rcaspiration of a sterile 
saline solution in distal segments of the hing via a fiberoptic 
bronchoscope. Ahiendt w o/.. (76) examined a series of 50 
resected noo.SCIX tumor patients and compared the rumor and 
BAL with regard to molecular markers including p53 mutations, 
K-ras mutation, the mctbylalion status of the CpG island of the 
p/6 gene, and mic rosatellite alteration (Tables I and 2). With 
the possible exception of the test for microsatellite alteration, a() 
of the tests had relatively high sensitivity and could detect 
mutant cells in the pi esc nee of :i laigc excess of normal celts. 
The frequencies of these chances in the tumors ranged (torn 
27% ffot K-ras mutations) to 5-6% (for p53 mutations). As 
expected, p53 mutations were moic frequent m central (predom- 
i nam ly- squamous cell) tumors, and K-ras mutations were more 
frequent m peripheral (predominantly adenocarcinoma) tumor/,. 
The specificity was high I nearly J 00%) because, with the ex- 
ception of micK»saicl!ite alirianons. the same genet rc change m 
OA I. sample as in rumors was always found, but the sensitivity 
was low, and in only of tumors that contained molecular 
lesions were the sarin* abnormalities detected in corresponding 
BAI. fluids Specifically, the tests wen- least helpful in the 



group of patients in whom improved diagnostic abilities are 
most needed, those with small, peripherally.located tumors (77). 
Unfortunately, the investigators were not able to compare the 
molecular tests with routine cytopathological analysis of the 
BAL specimens. The sensitivity of the molecular tests in BAL 
specimens has to be improved, and we need to know the results 
from subjects at increased risk (current and former smokers 
without rung cancer and survivors of previous cancer of the 
upper respiratory tract) and subjects with chronic lung diseases 
as well as results from healthy never smokers. 

A European group has previously shown that generic al- 
terations delected in DNA from bronchial lavage of individuals 
with lung cancer were also found in individuals with no evi- 
dence of malignant disease (78), which raises the question about 
the specificity of such molecular damage in neoplastic condi- 
. tions. To improve the sensitivity and specificity of detecting 
allelic imbalance in lung tumors, high throughput PCR-based 
microsatellite assays have been established (79). In a recent 
study by Fielding et at. (80), the up- regulation of hnRNP A2/BI 
was found to be a promising marker in BAL for the detection of 
premalignant and malignant bronchial lesions with a diagnostic 
sensitivity of 96% and a specificity of 82%. 

It is too early yet to make conclusions as to whether BAL 
examinations will add to other pathological/molecular biologi- . 
cal clinical studies. To obtain diagnostic material for BAL 
bronchoscopy is required, and we do not have any data that 
compare BAL examinations with biopsies. Thus, we do not 
know whether BAL is a valuable adjunct lo the biopsies taken 
under the same bronchoscopy procedure. 

Specimens for Clinical Testing: Peripheral Blood 

For many years scientists have searched for a lung cancer- 
specific rumoi marker that could be detected in peripheral blood. 
Optimism was raised in the "early" immunocytochemistry era 
by the use of monoclonal antibodies raised against more- or- less 
specific epithelial epitopes. In the search for epithelial cells in 
peripheral blood and bone marrow, monoclonal antibodies 
against cytokeralin have been used. However, these reactions 
are clearly not cancel-specific, and some antibodies have been 
shown to cross- react with norma) blood or bone marrow ele- 
ments (81, 82). Another explanation could be thai cells from the 
macrophage/monocyte system may contain proteins derived 
from the primary tumor .that have undergone . necrosis and 
3poptosis and that ihese processed proteins are recognized by 
the antibodies (82). On the basis of "traditional" immunocyto- 
chemisiry. no markers have been able to delect premalignant or 
car ly- malignant disorders based on a peripheral blood sample. 
Howe vcr. .with the development of DNA technolocrcs. new 
possibilities have been raised, and, with the use of PGR tech- 
niques, some promising reports have been published. 

Nanogram quantities of DNA circulating in blood arc pres- 
ent in healthy individuals (S3. 84). Tumor DNA is also released 
into the plasma component in increased quant iiics (85. f>r>). 
Thus, die plasma and serum of cancer patients is enriched in 
DNA. an' average four times the amount of free DNA as com- 
pared with normal controls (87). In a study by Chen et a I. (SSV 
a comparison of rnicrosateJIite alteialions in rumor and plasma 
DNA was done in SO IX patients, and 93% of the patients with 
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Table J Tissues and other resources for the study of molecular markers 


Specimen 


Ref. 


Comments 


Tumor tissue 


Numerous 
i 


Mixture of ceil types, may require microdissection (139). Extensively used 
for roost. studies. Alcohol- fixed fine-needle aspirates may be used for 
mutational arid other studies. 


Sputum 


65, 68,71, 74 


Respiratory cells usually in small minority. Most samples fixed in 
Saccomanno's fixative. Several studies have been performed on these 
specimens many years later. 


Surrogate organ 


140 


Predominantly, squamous epithelial cells. Buccal smears; brushihgs of 
tongue or tonsil may be explored as potential surrogate organs resulting 
from the field effect of tobacco damage of the entire upper 
aerodigestive tract This concept needs to be confirmed. • 


Bronchial brush/wash 


HI, 142, 143 


Predominantly respiratory cells. Fresh, frozen, or alcohol-fixed samples are 
suitable for multiple studies including FISH." 


Bronchial tissues 


42, 43, 45, 144, 145 


Usually from bronchial biopsies, but may be obtained from surgical 
resection specimens. Formalin fixation and paraffin embedding required 
for histological diagnosis, although EASI preps may permit 
identification and isolation of subpopulations. Paraffin sections may be 
used for genotyping polymorphisms," for alleloryping, and for in situ 
hybridization. 


BAL fluids 


76, 78, 146, 147, J48 


BAL fluids are useful for examining the peripheral airway cells, which are 
the precursor cells of most, adenocarcinomas. Numerous mononuclear 
cells present. Enrichment of epithelial cells desirable. 


Blood components 


72, 92, 149 


Analysis of circulating tumor cells and genetic material shed by dying 
rumor cells into the plasma component may yield useful biological and 
diagnostic information. Gene mutations and presence of epithelial cell 
markers have been used to delect circular ing tumor cells. Gene 
mutations, allelic loss, microsatellite alterations, and aberrant 
methylation have been used to identify tumor cell DNA released into 
the fluid compartment. 


Tissue for molecular staging 


150, 151 


All bough little data exists for lung cancers, regional lymph nodes, seniinel 
lymph nodes, and surgical resection margins have been used in other 
rumor types for molecular staging. 


Tumor coll lines 


152, 153 


Provide an unlimited self-replicating source of high-quality moleculai 
reagenis and have been used for numerous studies. Cell lines may or 
may not reflect ihe properties of ihe tumors from which they were 
derived (26), although they probably icpiesent cellular subpopulations 
(27). Aggressive metastatic tumors are more likely lo be successfully 
cultured (28) resulting in skewed data. 


Cultures of nonnialignant (issues 


154. 155 


Epithelial cultures may be useful for studying molecular changes during 
multistage pathogenesis. Temporary as well as a few immortalized 
cultures from nonmalignant epithelial cells have been established. 
B-lymphoblasioid cultures are useful for linkage analysis, for genetic 
susperiibrliry studies, and for alleloryping corresponding tumors. 


Nonmalignant tissue from patients 
3nd ftnm cancer- bee relatives 


156, 157. 158 


Tissues such as buccal smears, tumor- free lymph nodes, and peripheral 
blood mononuclear cells are useful as connols for linkage analysis, for 
genetic susceptibility studies, and for alleloryping corresponding tumors. 



"FISH, fluoresce nee in siiu hybridization; EASI. rpii he-Hal npgregaic separation and isolation. 



microsatellite alterations in tumor DNA also had modifications 
in the plasma DNA. However, some patients had LOH only in 
the rum ot DNA. Because most of the micH'-satellite alterations 
we ic si mi hi in rumor DNA and plasma DNA. they concluded 
that some of the DNA circulating in the blood comes I torn the 
rumor. Thus, modifications of circulating DNA can be used as 
an early detection marker. Detection of a ben ant DNA methyl 
ation m serum DNA in patients with non-.SCl.C lias been 
reported (12). Although the number of patients w;ts small ami 
the hyper methylated DNA was (ootid in all si a pes. it open:- up 
lot the possibility to be used as an early long cancer detrition 
market Furthermore. 'pi.> and ras gene mutations have been 



detected in the plasma and serum of patients with coloiectal 
cancels (89-91). pancreatic carcinomas (92. 93). and hemato- 
logical malignancies (94). 

In conclusion, the limited direct accessibility of lung car- 
cinomas has led lo ef foils to identify tumor- associated soluble 
matters in senjm or plasma. Many of the cunently recognized 
soluble markers were fit si identified as "tumor" markers but. 
when evaluated in nonneoplastic tissue, have often been found 
in normal celts as well as in runiots. I : oi eaily detection of )»ng 
cancer, we need more clinical data evaluating these new molec- 
ular biological markers from multiple sites, especially in high- 
risk groups. 
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Table 2 Molecular approaches for lung cancer investigation 



Specimen 



Ref 



Comment 



Gene mutations 



Allelotyping 



Gene expression at RNA 
and protein level 



159, 160, 161 



18, 15S 



145, 162, 163 ; 164/165, 166 



Molecular cytogenetics 



Comparative genomic 
hybridization 

Morphomctric studies 



40, 167,. 163. 169, 170 

171, 172 

74, J 73/ 174 



Widely used technique, especially for p53 and ras genes: Often used 
to determine the r6Je of a newly discovered gene in the 
pathogenesis of hing cancer. May be of diagnostic am! prognostic 
significance. Multiple methodologies available. . .. 

Useful as a partial substitute for mutational analysis and for 
determining the chromosomal locations of putative tumor 
suppressor genes. Widely used to study multistage pathogenesis. 
Readily performed on formalin- fixed. and microdissecled tissues. 
Increasing use of genotyping using automatic sequencers. 

Northern blotting and reverse transcriplion-PCR are widely used to 
investigate gene expression. Western blotting often used for. 
detection of protein expression. In situ hybridization for message 
expression can be performed on paraiTro-embedded tissues and, 
thus, can be used to investigate multistage pathogenesis, 
Microarray techniques offer promise of examining all or most of 
the genome but currently require relatively large amount of high-, 
quality RNA from purified cell populations. Sage technique useful 
for investigation and identification of expressed genes. Similarly, 
advances in proteomics will permit simultaneous detection of 
multiple proteins. Numerous immunohistochemical studies of 
. oncogene expression have been used to study multistage 
pathogenesis. Of particular interest, hnRNP expression on 
exfoliated epithelial cells in sputum samples may predict for 
development of cancer. 

In situ hybridization studies of fixed materials or using smears has 
provided considerable information about numerical and structural 
changes. 

Useful for detection of gene amplifications. Less sensitive for the 
detection of regions of allelic loss. 

May be applied to paraffin-embedded tissues. Useful for determining 
aneuploidy and for measuring a number of nuclear and 
cytoplasmic parameters. ' L 



Specimens for Clinical Testing: Bronchoscopy 

WJLB is the roost commonly used diagnostic tool for ob- 
taining a definite histological diagnosis of lung cancer. Bron- 
choscopy has major diagnostic limitations for piemalignant le- 
sions. Because these lesions a»e only a few cells thick (0.2-1 
mm) and have a surface diameter of only a few millimeters, they 
rarely are observed as visual abnormalities. Woolner (95) re- 
ported thai squamous cell CIS was visible to experienced bron- 
choscopisls in only 29% of cases. To address this limitation, 
fluorescence bronchoscopy was developed, Katly studies of 
fluorescence bronchoscopy entailed the use of fluorescent drugs 
(hematoporphyrin dyes) that were preferentially retained in ma- 
lignant tissue (96). Although, studies evaluating this approach 
did. in fact, show that early invasive rmd t.n sun cancers could be 
localized, the detection of dysplasia remained problematic (97- 
100). Furthermore, ihc development of photodynnrnic diagnos- 
tic systems was hampered by problems including skin photo- 
sensitizing and interference with tissue nuiolluorescence. To 
overcome these problems, a new laser photodynamic diagnostic 
system was developed (101). This system deterred rumor- 
specific drug fluorescence at b30 nm wavelength, which is far 
from normal tissue an to fluoresce no- (?U0 -oS0 nm). and iniei- 
ferencc by antolluoiesccnce horn normal tissue should then 
have been eliminated, but it remained a Mgnihcani problem 
(102). 



Another approach was developed by Palcic et al (103), 
who noticed the lack of autofluorescence in the tumor lesions by 
using blue light (442 nm) rather than white light to illuminate 
the bronchial surface. They amplified the difference in autofluo 
rescencc between normal, piemalignant, and turn or tissue for 
clinical use (103, 104). Using a high quality-charge coupled 
device and special algorithm, the LJFE was developed, taking 
advantage of the principle that dysplastic and malignant tissues 
reduce 3Utoftuorcsccnt signals compared with normal tissue 
(Fig. 3). 

Several studies have been perfoirned comparing the diag- 
nostic specificity and sensitivity of LIFE bronchoscopy versus 
WLB in diagnosing preinvasive and early- invasive lesions 
(105-109; Table 3). Most of the studies reported a higher 
diagnostic sensitivity of L1FJ: bronchoscopy in the detection 
premalignant and eat ly- malignant lesions at the cost of lower 
specificity {i.e.. more false- positive results). In most of these 
studies, lesions with moderate dysplasia or worse were the target 
of the study and rated as "positive." The prevalence of prein- 
vasive and early lung cancer varies widely from one study to 
another, bom 20.?% ( 10>) to fO 8% (102). The explanation 
might be beyond the risk profile of genetic variations oi differ- 
ent levels of o.peiience among the endoscopists as well as the 
pathologists involved, f urthermore, there seems to be a training 
effect in using the I. II 1: bronchoscope, which has been demon- 
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Fig. 3 A, normal WLB and normal LIFE bronchoscopy. B, WLB show* inflammatory changes in ihe. bronchial mucosa but 'no suspicion ol 
■malignancy [le/i). LIFE bionchoscopy shows diffuse reduced autofluoicscencc \ visualized by diffuse re J- brownish cohrizaiion; arrows). Biopsy 
demonstrated diffuse severe dysplasia. 



strated by Venmans et al. (107). In their study, the diagnosltc 
sensitivity increased from 67 lo 80% when comparing the first 
and the second half of the study. The use of rhe LIFE device in 
conjunction with WLB improved the detection rate of preneo- 
plastic lesions and CIS significantly (Tabic 3). Kurie etui, f 106) 
looked for more subtle tissue transformation, but their shtdv 
included few patients \*ith moderate dysplasia or worse. No 
improvement in the evaluation of metaplasia inde> was ob- 
served by the use of LIFE bronchoscopy. Thus, differences in 
the study population might explain the different conclusion. 
There aie still no clinical studies with sufficient long-term data 
showing that moderate dysplasia is rhe most ic levant clinical 
predictor o( eventual malignancy. Li mil. it inns in making con- 
clusions from the c>:isi:ng studies rue also the potential meth- 
odological bias related to the ordei in which ihc different bron- 
choscopy procedures ate dene and whether the same examiner 
has per formed both procedures. To address these issues., a 



prospective randomized study between LJFE bronchoscopy and 
WLtt was done 3t the University of Colorado Cancer Center. 
The study design included a iandomi?nlion with regard to the 
order of procedure as well as the order of the individual bron- 
choscopist DO')). The order of the procedure and of the indi- 
vidual bronchoscopes! did not affect the results 'Ihe study also 
demonstraied a significantly higher sensitivity in detecting pre- 
malign;:ni lesions visualized by the LIFE, but at the cost of a 
lower specific i ly (JOV). 'Hie reason for the low diagnostic spec- 
ificity fo^m] with the LIFE bionchoscopy in the d liferent studies 
might be attributable to the .visualization of more abnormal fo( t 
with the LIFE bronchoscope, with ihe consequence thai a largei 
numbei o) biopsies were taken and. thus, ihcic was a hieher nsk 
of rmue false- positive results. The use of LifE bionc hnsropv 
has led to the identification of a new morphological entity, the 
ASD. which is described above. In a recent morphological study 
angiodv; pfastK" change.-; weie treqocntlv loom) m preneoplastic 
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Fig. 4 Seventy-one-year-old man with a spicular nodule in upper left 
lobe demonstrated on low- dose helical CT {picture), but not visible on 
chest X- radiography. CT- guided biopsy showed adenocarcinoma. 



and early-malignant lesions in the bronchi (26). The morpho- 
logical entity has been confirmed in preneoptasias among smok- 
ers, and the perspectives of this finding have been extensively 
discussed (1 10). The prognostic significance of this morpholog- 
ical enlity is currently studied in ongoing long-term follow-up 
studies. Future studies have to evaluate the role of ASD as a 
biomarker for early lesions and whether it can be used as a 
marker for treatment effect or therapeutic target for chemopre- 
v en lion. 

The LIFE bronchoscope may play an important role in the 
screening and follow-up of subjects at high risk of developing 
lung cancer. At this stage, however, it is unknown whether the 
LIFE bronchoscope will lead to a reduction in lung cancer 
mortality. There are also no data on cost-effectiveness and 
cost-benefil analyses available for Ihis new diagnostic proce- 
dure. The use of (he LIFE bronchoscope may also in the future 
be extended to other indications, e g., patients staged as having 
resectable lung cancer on one side. Whether LIFE bronchoscopy 
of the contralateral lung will disclose abnormalities, which 
would change the therapeutic decision, is not yd reported.. 

Recent Advances hi Radiology 

The previous NCI-sponsored screening trials failed to dem- 
onstrate any reduction in the lung cancer mortality by sputum 
cytology and yearly chest radiography as mass screening tools 
for lung cancer screening. Limitations of design and execution 
of the studies, however, have been diseased extensively {8, 
III. 112). An extended follow-up (median. 20.5 years) of the 
Mayo Lung Project was recently published ( I 13) There was 
still no difference in lung cancer mortality between the inter- 
vention aim and the control arm (4.4 rersut 3.9 death? per 1000 
person-veais). Howevei. the median survival for patients with 
resected early- stage disease was 10.0 yeais in the mtcnention 
mm vcimh 5.0 Years in (he usual-can* arm ■-' t> .05) J he lanei 
find incs have raised the question as ro whether some small 
lesions wjih limited clinical relevance mav have born idem i tied 
m the intervention ami. and the question ol Voverdiagnesis" was 
discussed in accompanying editorials (1 



Mass screening for lung cancer has been performed in 
Japan for many years and has been performed in over 500,000 
people in about 80% of the local communities (1 15). Sobue ei 
al (!16) observed that annual clinic-based chest X-ray screen- 
ing for lung cancer in Japan showed reduced hing cancer mor- 
tality by about one- fourth among individuals who. underwent 
screening once a year. In this screening program, the relative 
odds ratio of dying from lung cancer within 12 months was 
0.535 and in the 12-24-monlb period was 0:638 (117). How- 
ever, many studies have focused on the pitfalls in the detection 
of abnormalities by radiography (1 J 8-122). The limit of chest 
radiographic sensitivity for nodule detection is roughly I cm in 
diameter, by which time the tumor has over 10 9 cells and may 
already have violated bronchial epithelium and vascular epithe- 
lium. CT has been shown, to be more effective in the detection 
of peripheral lung lesions compared with plain radiography or 
conventional tomography of the whole lung (123, 124). 

Spiral CT scan is a relatively new technology with the 
ability lo continuously acquire data resulting in a shorter scan- 
ning time, a lower radiation exposure, and improved diagnostic 
accuracy compared with those of plain radiography (125-127). 
Spiral CT allows the whole chest to be imaged in one or two 
breath- holds, reducing motion artifacts and eliminating respira- 
tory misregistration or missing nodules. Although there is 
greater radiation exposure with CT than with chest radiography, 
low-dose techniques (lower mA of 30-50 compared with 200 
for conventional CT) have achieved calculated exposure doses 
that are 17% that of conventional CT and 10 times that of chest 
radiographs. Further reduction »n radiation dose while maintain- 
ing diagnostic accuracy is a topic of current research. Further- 
more, for the baseline screening, low- dose spiral- CT- scan i.v. 
contrast is not administered. Nodules as small as 1-5 mm can be 
shown with modern spiral CT technology (25, 128). The obvi- 
ous advantages with this new technology led some groups in 
Japan and in the United Stales to look to low- dose spiral CT as 
a tool for screening (Refs. 129- 13 1; Tables 4 and 5). 

" In a Japanese report, spiral CT scans and chest radiographs 
were done twice a year in 1369 individuals (129). Peripheral 
lune cancer was detected in 15 (0.3%) of 3457 examinations, 
and, among the 15 lung cancel cases detected, the results of 
chest X-ray were negative in 11(73%), and the tumors were 
detected only by low- dose spiral CT. The detection rates of 
low-dose spiral CT and chest X-ray were 0.43% (15 of 3457 
examinations) and 0.12% (4 of 3457 ex ami nations), tespec- 
tively. Furthermore. 14 (93%) ol the 15 lung cancers were stage 
I disease. The histology showed that II of the 15 lung cancer 
cases were adenocarcinoma, and 4 had squamous cell carci- 
noma. The effective exposure do.se with spiral CT scan in that 
study was calculated to about one- sixth that of conventional CT . 

The ELCAP in New York was designed to determine: in) 
the frequency with which nodules were detected: {h) the fie- 
quency with which detected nodules represent malignant dis- 
ease: and (r) the frequency with which malignant nodules aie 
curable 1 131). In the tlXAP study. 71 hmg cancers weie found 
among 1000 subjects screened. Among the ?7 patients with 
cancer. S5% had stage 1 disease ( Tab if _\( 

Another population- based study on low- dose C I screening 
has been published by Sone ni. il30). using a mobile low- 
dose spiral CT sea unci. The detect ion i ale- was 0.-18?p (/ c, 4-5 
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Tahiti Bronchoscopy versus WLB in diagnosing piemaligriant and early-malignant lesions 



Sensirivit, Specificity Predictive values 



Relative Revive W NPV 



Nn of I.FP+ pSm, LIFE+ speciftcity LIFE+ LIFE+ PPV NPV PPV NPV 

Aulho , WLB LIFE WLB WLB LIFE WLB Lig+WLB WLB WLB LIFE L IFE WLB WLB 

^J/r S % & S « % ™ S Q.S5 ^ o. 87 o.n o.so 

" PPV, positive predictive value; NPV, negative predictive value; NR, not reported. 
b Based on reference pathologist. 
' If invasive carcinoma is included. 

Table 4 Result s from three population-based screening studies with low-dose spiral CT (LDCT) 

Detection rate % 
Predictive — A 8 C mci - 



No. of individuals True False 
Authors ■ studied positive* positive* % value % LDCT X-ray Pack-yr yr 



s" * k ^ 0,2 



- Defined as individuals with "test-positive," in whom further workup gave no suspicion of malignancy. 
* The study also included a group of nonsmokers. 
' Average = 45 (not reported in ihe other studies). 



Table} Histology, stage, and size of primary lung cancer delected by low- dose spiral CT 



No. of cancers/ — : „ - A , . on 

Author No. screened AW Sguam. Other I I) 111 IV Average Range <10 H-20 >2I 



Histology % 



TNM % Si2e ( mm ) 



KandcoW.(j2?) 15/13690.1%) 73 17. 93 7 2 *-I8 

Sone ««/.<! 30) 19/5483(0.3%)- 63 5 32 84 16 17 6-47 4 . M 3 
Hcmchke « u f. (131) 27/1000 (2.7%) 67 .3 30 85 4 1 1 ■ *£ » 



* Adcno, adenocarcinoma; Siiuam. : squamous cell carcinoma; TNM, rumor- node- metastasis. 

cases per 1000 examinations). Surpnsmglv, there was no dif- 40-74, whereas ELCAP screened people at high risk, ages >60. 

fcrence in the detection rote among smokers (0.52%) versus with a tobacco history of al leas. JO pack-years. Thus, as 

nonsmokers (0,16%). The results fiom the three population- expected, the risk of the populate to be screened affects the 

ba^ed studies are summarized in Tables 4 and 5. The conclusion rate of cancer detection. 

from ihe<e ^rud.e< is ihai S5% of the lung cancers detected by Questions remaining to be answered include: (a) what are 

low-dose CT were in staee I. offering improved possibility for the diagnostic sensitrviry and specialty of thts procedure: and 

curative treatment and better prognosis in general. However, the (b) does screening reduce lung cancer mortality? The spiral C 1 

i««e of "fahe-po'tiiive-- 5 scans has to be taken into consideration. has not been as sensitive for small central cancers as .1 is for 

Thus far up to 20% of the participants with nodules on the scan small peripheral cancers (129, 131). Minute nodules of lung 

had no malignancv during the follow-up period. The possibrlity cancer that are near the threshold of.delectab.l.ry may be over- 

that the cancers found represent incidental cancers as in the looked at spiral CT screening (132). A prospective study of the 

Mayo Lnrig Project must also be considered (1)4). The results diagnostic sensitivity of spiral CT has recently shown ihai the 

from these Mudies confirm the expectation that low-dose CT diagnostic sensitivity exceeded the sensitivity of conventional 

increases the detection of small noncainfied nodules and, that CT in previous reports (25). However, there were l.mitatjons m 

lung cancer at an earlier :,nd moie curable stage are detected. the detection of infrapnlmonary nodules smaller than 6 mm and 

The mobile CT sciccnine study bv Sone n ,:/. (130) showed that of pleural lesions. Compared with surgery (thoracotomy w,th 

low-dose CI increased the likelihood vi detection of malignant palpation of deflated lung, resection, and histology), the sens.- 

di<ene 10 >mie, ?< eomp.m-d w.h radu^aphv. The overall rate .. livity of spira) CT was 60% for inirapulmonary nodules of < o 

of mahpnan. disease was lower .r. -he Japanese studies (129. mm and 93% lor nodules of SO mm and was 100% for neo- 

P>0) compared with the TLC.AP Mttdv t K e I 131; detection rates plastic lesions >b mm. Fwriheimore. a marked drffeienee in the 

0 43- 0 4S% ivrw^ 2 7%V This could be because the Japanese sensitivities of two independent observers was knmd for nod 

.todies Greened individuals f'™» 'he ^""fll population ages ules smaller than 6 mm, whereas agreement was much bene, loi 
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6-JO-mm nodules (25). Given these promising preliminary clin- 
ical results, further researches needed to determine the optimal 
technique for spiral CT screening, which includes collimation, 
reconstruction interval, pitch, and viewing methods. Decreasing 
the slice thickness to 3 mm, monitoring the viewing of exami- 
nations, and computer-aided diagnosis have been used to im- 
prove the diagnostic capability of spiral CT in the detection of 
pulmonary nodules (J 33- 136). 

Future large scale randomized studies have to confirm 
whether in fact spiral CT screening will lead to a reduction in 
lung cancer mortality. In a randomized study, the following 
questions arise: (o) what is the optimal high-risk group to study 
and what should be the control arm? (b) what should be the end 
points (goals) of the studies? The ultimate goal is to reduce the 
lung cancer mortality. However, although this is a long-term 
goal, intermediate end points from such studies should be eval- 
uated. The change to more curable stages at diagnosis for ihe 
lung cancer patients is one such immediate goal; (c) what is the 
optimal workup and the morbidity of this program? (d) what is 
the cost, of such a screening program? and (e) what is the 
false-positive rate of the screening findings? Incorporation of 
smoking cessation programs should be included in ihe future 
design of screening studies because it has been shown thai 
screening with low- dose CT in participants who are still smok- 
ing provides substantia! motivation for smoking cessation (137). 

The sludies with spiral CT-scan have demonstrated the 
superior diagnostic ability in the detection of small peripherally 
located tumors, most of the malignant ones of adenocarcinoma 
type of histology. The diagnostic sensitivity of spiral CT for 
more centrally located rumors (mostly squamous cell carci- 
noma) is significantly lower than for the peripherally located 
ones. Through. these spiral CT sludies. we will learn about the 
biology, pathology, and clinical course of these small tumors, 
which might be different from whal we know about clinically 
more evident tumors detected routinely in previous studies. 

Because lung cancer is so common, the introduction of any 
new screening technique in this area has to be underpinned by 
careful definition of the cost implications and must be justified 
by compelling evidence. The cost- effectiveness of the spiral 
CT appioach should be assessed by evaluating the iate of 
over- diagnosing nonmalignani, relatively common abnormali- 
ties and comparing CT imagine to other diagnostic technologies. 

PHT with FOG has lecenily emerged as a practical and 
useful imaging modality in the preoperative staging of patients 
with lung cancer . However, whereas CT is most frequently used 
to provide additional anatomical and morphological information 
nbout lesions, the f-DG PUT imaging provides physiological and 
metabolic in format] on thai characterizes lesions that arc inde- 
lerminale by CT I" DC PI:T imaging lakes advantage of the 
increased accumulation of f'DG in ti a ns formed cells and is 
sensitive i — for the detection of cancer in patients who 

have indeterminate lesions on CT ( 138). The specificity (— 85-%) 
of PET imaeini; is slightly less than its sensitivity because some 
inlhminatcrv pier esses avidly accumulale IDG. The high neg- 
ative pie<1i«'ti\ t- v:i!ui' i>| pj : .T sm'geMs; that lesions considered 
neenuve on ihe sfmh arc benign, biopsy is not needed, and 
radiographic follow -up is recommended Several sludies have 
documented the increased accuracy of PET compared wiih CT 
in the evaluation ol 'he I j tin r :ind mediastinal lymph node s tarns 



in patients with lung cancer (138). However, the PET resolution 
is sufficient only for nodules ^6 cm and will not be helpful in 
detecting the very small nodules. Compared with low-dose 
spiral CT, the FDG PET scan is more expensive and time 
consuming. The role of PET scan in early , diagnosis of jung 
cancer in an asymptomatic high-risk population is not yet eval- 
uated. However, future studies have to include PET evaluation 
to define its role in a population screening setting. 

Conclusion 

Recent advances in molecular biology and pathology have 
led to a better understanding and documentation of morpholog- 
ical changes in the bronchial epithelium before development of 
clinical evident lung carcinomas. Combined with technical de- 
velopments in radiological and bronchoscope techniques, ihese 
procedures offer great promise in diagnosing lung cancer far in 
advance of clinical presentation. Any of these individual proce- 
dures could be incorporated into the routine management of 
individuals at risk for developing primary or secondary lung 
cancer, and for several of these methods, clinical studies are 
under way. Preliminary reported data are very promising for the 
early detection of lung cancer. Future studies must incorporate 
the different methods in a multidisciplinary scientific setting to 
evaluate the role of the individual method in the overall man- 
agement for individuals at high risk for developing lung cancer. 
Several of these tests might diagnose the disease at the stage of 
clonal expansion before invasive carcinoma has developed. A 
management and intervention strategy appropriate to that stage 
of disease have to be developed. Preliminary studies of chemo- 
prcvention agents are reported, and new agents based on other 
biological mechanisms are under development and ready for 
clinical trials. It is now time to plan clinical trials that evaluate 
both diagnostic and therapeutic approaches to access their im- 
pact on the incidence of clinical lung cancer. 
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moderate dysplasia) and 3 were sessile (1 with moderate and 2 
with focal!y severe dysplasia)]. Samples from 16 colorectal ade- 
nocarcinomas (1 Dukes* stage A, 6 Dukes* stage B, 8 Dukes*, stage 
C and 1 Dukes* stage D), 1 villous adenoma and 1 malignant colon 
lymphoma were prospectively collected during 1999-2000. These 
prospectively collected, tissue specimens were dissected so that 
samples contained bom normal mucosa and tumor tissue and were 
obtained within 30 min following Surgical bowel resection. The 
specimens were immediately fixed in 4% neutral buffered formalin 
for 20r24 hr and then paraffin embedded. The 14 archival samples 
had also been formalin fixed and paraffin embedded. * 

Generation of nonoverlapping TIMP-1 cDNA fragments by PCR 
The full length T7MP-1 cDNA (GenBank NMJXB254) cloned 
in pSP64 vector 29 was used as template to generate 2 nonoverlap- 
ping PGR fragments for in vitro transcription, and named f 104 (bp 
56-378) and f!06 Q>p 398-680). F ir st, the whole insert (-780 bp) 
. was cut out by digestion with Hind 111 and tfamW and punned 
after agarose gel electrophoresis using the Qiaex II gel extraction 
kit{Q>agen, Crawley, pnited Kingdom). To generate nonoverlap- 
ping antisense probes and the corresponding sense probe, 2 PCR 
fragments were generated using. Upstream primers flanked by a 
linker sequence containing an Ecoftl restriction enzyme site (un- 
derlined nucleotides) and a 13 polymerase binding sequence 
(boldface) 5 r -(gagaattcattaaccctcactaaagggaga>3', and down- 
stream primers flanked by a linker sequence containing a BamYQ 
restriction enzyme site and a T7 polymerase binding sequence 
5'-(ggatcctaatacgactcactatagggag) : 3' 9. The TIMP-I specific up- 
stream primers were 5'- acccaccatggccccctttg -3' for fl04 and 5'- 
(linker) - gcaggatggactctlgcaca -3' for fl06, and the downstream 
primers were 5 r - linker- actcctcgctgcggttgtgg -3' for f!04 and 5' - 
(linker) - tatctgggaccgcagggact -3 ' for fl 06. PCR using the 2 fl 04 
primers of the 2 f 106 primers was done as previously described. 30 
The PCR products were purified by column chromatography 
using S-200HR microspin columns (Amersham Pharmacia Bio- 
tech, Inc., Piscatway, NX), and their size tested by agarose gel 
electrophoresis. Both migrated as —300 bp fragments in accor- 
dance with the predicted size (322 and 282 bp, respectively). An 
ABI. PRISM 310 genetic analyzer was employed for DNA se- 
quencing analysis and was performed according to the* manufac- 
turer's instructions (Perkin Elmer, Applied Biosystems, Foster. 
City, CA) using the primers specified above. The DNA sequences 
obtained were confirmed by comparison with the specific TDvfP-1 
cDNA nucleic acid sequence (GenBank NM_003254). 

Plasmids containing human MMP-2 cDNA .(PCO17201, bp 647- 
1284) and human MMP-9 cDNA (pCol9202, bp 1751-2326) have 
been described elsewhere. 31 

In vitro transcription 

Antisense and sense riboprobes were labeled with j5 S UTP 
(NEN, Boston, MA) by in vitro iranscriplion using T7 and T3 
RNA polymerases. (Roche, Basel, Switzerland). The DNA tem- 
plate was digested with DNase (Proraega, Madison, WI). Nonin- 
corporaied 53 S UTP and DNA was removed by column chroma- 
tography using S-200HR microspin columns (.Amersham 
Pharmacia Biotech, Inc., Piscaiway, NJ). The 35 S activity was 
adjusted for every probe by dilution to 500,000 cpm/uJ. 

In siiu hybridization 

)n situ hybridization was performed essentially as described 
previously. 32 In brief, 3 u.m paraffin sections were deparaf finized 
in xylene, hydrated with graded ethanol and boiled in a microwave 
oven for 10-12 min in 10 mM citrate buffer, pH 6.0. After 
additional 20 min at loom temperature, (he sections were dehy- 
drated wiih graded ethanol and die 35 S labeled probes (2X 10 6 cpm 
id 20 u.1 hybridization mixture 5 "' per slide) incubated overnight at 
55°C in a humidified chamber. Sections were washed in Mellen- 
dahl chambers with SSC buffers containing 0.1% SDS and 10 mM 
DTr at 150 rpni at 55°C using a Biihlei incubation shaker (Jo- 
hanna Ono GmbH, Hechingen, Germany) for 3 0 min in 2XSSC, 



10 min in OJXSSC, and 10 min in 0.2XSSC Sections were then 
RNase A treated for 10 min to remove nonspecifically bound 
ribpprobe. Subsequent wash* was performed in 0.2 X SSC as spec- 
ified above. Sections were dehydrated and soaked* into an autora- 
diographic emulsion (pford), exposed for 5t7 days if not otherwise 
stated and finally developed. Sections were counterstained with 
: haematoxylin and eosin. 

Jmmunoperoxidase staining 

Invrnuriohistcchernistry was performed essentially as described 
previously.^ 2 Five micrometer paraffin sections were deparafr 
finized with xylene and hydrated through etbanol/water dilufjons. 
Tissue pretreatment was performed with protease-K (5 u-gAnl) 
digestion for 20 min. Sections were blocked for endogenous per- 
oxidase activity by treatment with 1% hydrogen peroxide for 
15 min. The sections were washed in 50 rnM TriS J 50 mM NaQ, 
pH 7.6, containing 0.5% Triton "X-10Q (TBS-T). Incubation with 

-aritibedies^was-done overnight at4°C. Sheep polyclonal antibodies 
(pAb) against TTMP-1 and nonimmune goal IgG were used at a 
final concentration of 4i) ixg/ml. Two monoclonal antibodies 
(MAb) against TIMP-1 , 33 NM4 {clone rT3X6A > NeoMarkersv Fre- 
mont, CA) and CalB2 (clone 147-6D11, CalBiobem, Oncogene 
Res. Products, Cambridge, MA), and a >lAb against trinitro- 
phenyl (TNP)** were all incubated at 1.0 u.g/ml (all 3 MAbs are 
IgGl). CaIB2 MAb recognizes both free TIMP-1 and TXMP-1 
in complex with MMPs. 33 NM4 MAb only recognizes free 
TIMP-1. 33 According to the manufacturer's descriptions, both 
MAbs are raised using recombinant human TIMP-1- Thp- sheep 
polyclonal antibodies were raised by immunization with TIMP-1 
purified from human dermal fibroblasts. The IgG was obtained by 
triple precipitation using ammonium-sulfate and characterized by 
irhmunodifxusion and rocket irmnurjoelectrophore^is. 35 In admV 
tion, we have shown that the pAb recognize both free and MMP- 
coraplexed TIMP-1 36 Furthermore, the specificity of the antibod- 
ies was analyzed by Western blotting analysis against recombinant 
human TIMP-1 expressed in NSO mouse myeloma ^ cells. Here, the 
antibody preparation recognizes a band of approximately 28 kDa 
in accordance with the molecular weight of TIMP-1. To certify 
that the pAb recognize TTMP-1 in colon tumors, the antibodies 
were immobilized on a sepharose column. Total protein extracted 
from 3 colon adenocarcinomas was passed through the column 5 
times and the bound and subsequently eluted protein analyzed in a 
Western blot using a TIMP-1 monoclonal antibody (MAC15). A 

- single band of approximately 28 kDa was revealed in accordance 
with the molecular weight of TIMP-1 (results not shown). In 
immunohistocheraistry, the sheep pAb were detected with biotin- 
ylated rabbit-anti-goat IgG, which cross-react with sheep IgG 
(1:100, code E466, DakoCytomationJ followed by horseradish 
peroxidase in complex with streptavidin (code K377, DakoCyto- 
matjon). The MAbs were detected with the Envision-mouse re- 
agent (EnVision reagent, K4003, DakoCytoroation), followed by 
tyj amine amplification, using biotinyl tyramine substrate as-spec- 
ified by the manufacturer (Nen, Boston, MA). Sections were • 
developed with NoyaRed substrate as specified by the manufac- 
turer (Vector Laboratories, Burlingnarne, CA) for 15 min. Finally, 
sections were counterstained in Mayers haematoxylin, dehydrated 
in ethanol and mounted. 

Combined in siru hybridization apd immunohistochemislry 

Double labeling by combining in situ hybridization and unrou- 
nohistochemisiry on paraffin sections has been described Pi- 
ously. 32 In brief, using MAb against ct-sm-aciin (clone 1 A.) 
diluted 1:1000. against cytokeratin (cloDe AE1/AE3) diJulea 
■1:1000, or against CD68 (clone PGM1) diluted l:?00 r sections 
were incubated for 2 hours at room temperature and then detected 
with ami-mouse-lgG/horse radish peroxidase-conjuijated polymers 
(Envision-mouse reagent, DakoCytomation, Glostnrp, Denmark). 
Sections were developed with diaminobenzidine (DAB) Jor 
7-10 min, and immediately dehydrated for in situ hybridization, 
which was performed as described above using the antisense 
probes of f 104. Seciions were counterstained with hacrnatoxyhn- 
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Results 

Analysis of T1MJ>-} probes and antibodies for in situ 
hybridization and immunohistochemistry 

Histopathologic*! diagnosis of prospectively collected speci- 
mens from 18 colorectal lesions revealed 16 colorectal adenocar- 
crnomas, J v,]l ous adenoma and } malignant lymphoma. *>S- 
labeled anosense and sense RNA probes were generated by m vitro 
'TT^^ >t! T^v*° in ^ nQDOVcr ' a ! J P , > n S DNA sequences of the human. 
TIMP-1 cDNA and tested by in situ hybridization on adjacent 
secuoos from 5 of the colorectal adenocarcinomas. The 2 antisense 
probes showed ah identical hybridization pattern in all the 5 cases, 
located rn the stroma] compartment surrounding the invading 
cancer cells, while no specific signal was seen with the 2 sense 
probe 5 1>. To test whether the TIMP-1 mRNA was accom- 
panied by TIMP-1 protein expression, irnmunohistochemistry was 
performed on 8 of the adenocarcinomas (including the 5 men- 
tioned abo ve ) and the malignant lymp homa usin g sheep anti- 
^^TTTOFrr^oT^ 

11MP-1 m sou hybridized sections. Tht TIMP-1 mRNA and 
immunoreactiyity was observed in the same cells in all of the 9 
*n»f^>i °T 2A\incMins Ae malignant lymphoma. The 
anti-TlMP-1 polyclonal antibodies did not react with other cell 
populations m all of $ adenocarcinomas and the malignant lym- 
phoma apart from some normal and malignant epithelial cells uYat 
were weakly stained on the luminal apical surface. Two MAbs 
agamst TIMP-1 (CalB2 and NM4) required strong signal ampSfi* 
canon but showed a staining pattern similar to that of the poly- • 

^^^o 1 ^!^^ m > *** * e 0n] y ^^Ptionthat 
n ^ e of f 2 r MAI» stained the luminal apical surface of the 
normal and mahgnant epithelium; No signal was obtained with 
n^rnmune goat serum or a MAb (of same subclass as CaJb2 and'- 
NM4) directed against the synthetic hapten trinitrophenyl CTNP). 
TIMP- J mRNA expression patterns in colon cancer 

JT^ 0n ^l^- A ^ A Was *»' remain- 

ing 9- colorectal lesjons by in situ hybridization. TIMP-1 mRNA 



expressron was in all the cases of colon adenocarcinoma (including 
those mentioned above) highly expressed in stroma) fibroblasWikr 
cells located at the invasive front (Kg. ?>a,d). TIMP-1 mRNA 
signal was also observed in fibroblast-like cells located in the 
tumor stroma towards the colonic lumen in 8 of 10 cases where 

^pTVi^^ r S prCSdm (data M sh °W- No <* little 
f^"Af^ A V s dCteCtcd m ^ Part of the carcinomas. 

t»xtT colorectal adenocarcinomas, we observed TIMP 1 

mRNA signal in some fibroblast-like cells looted around the 
rnuscle layer, of some arteries located in the submncosa distant 
from the cancer area. The normal colonic mucosa, includin« the 
lamina propria that was present in all samples tested, was generally 
negative (Fig. 3b,e). Only a^elaTivery weak TIMP-1 mRNA signal 
was detected in stromal fibroblast-like cells surrounding one £ a 
very few normal crypts (Fig. 3c,/) in 3 out of 6 cases tested with 
extended exposure time (10 days vs. usually 5 days). In the villous 

-^Slf^ ° n]y 3 feW I***-* ™™ A J» si ^ cells assT 
- uwleJ witfrferfTdflamnoaiioB (data hot shown). In the malignant 
lymphoma of die colon, TIMP-1 mRNA expressing nbroblasi-like 
cells were, different from the adenocarcinomas, located in a dif- 
fuse pattern throughout the whole tumor. No. TIMP-1 mRNA 
signal was observed in any of the 18 cases in the cancer cells 
smooth muscle cells or vascular cells. - . ■ * 




c ' \Z l' m h y h »d«ation with 2 »onoverlappi»o jmP) 

s^ofic probes in-human colon cancer. Three adjacent from I 

colon adenocarc^noina were mcnbaied with o DonovSiSn ™S 
btcted a^sense probes for TIMP-1 mRNA |f J06 (ab) a ^104 
and a conespondmg TIMP- ) sense pjobe f 06 (c) \ The t Tu hv 

£>e > t ent ol^ in darkfield illumination ib-d). 



Characterization of TIMP- J rnRNA expressing cells 

rJZ ^ T^^lT™ 151 1 mRNA ^ Sitive ^blast-like cells 
could be (myeloblasts and/or macrophages, sections from 4 
colorectal adenocarcinomas and the malignant lymphoma were 
first- iirmiunohistochemicalJy stained with antibodies directed 

^^^" aC ^o r ^ etCCtion ° f Wofibroblast/smooth muscle 
cells. SMC)] or CD6H (for detection of macrophages) and subse- 
quently incubated with a TTMP-J mRNA antisense probe In 
normal colon tossue, a-sm-actin is expressed by vascular smooth 
muscle cells, smooth muscle cells of lamina muscularis mucosae 
and tunica muscularis as well as pericrypud myofibroblasts ^ In 
colon rumors a-sm-actin is expressed by tumor- associated fibro- 
blast-!^ cells located throughput the tumor stroma, which arc 
defined as myofibroblasts. No TIMP-1 mRNA was detected in any 
o>srn-actin positive smooth muscle cells, including those of the 
vessels, the lamina muscularis mucosae and the tunica muscularis. 
In addition, no TIMP-1 mRNA was detected in the a-sm-actin 
positive pencryptal myofibroblasts of the lamina propria in any of . 
the 5 lesions. TTMP-J mRNA signal was. in contrast seen in 
a-sm-acun-positive tumor associated myofibroblasts located at the 
invasive front of the colon cancers. In 3 of the adenocarcinomas, 
more than 80% of TIMP-1 mRNA-positive cells located close to 
JX. CanCCr Cdls weie «-sn>actin-positive (Rg. 4). 
TlMP-i mRNA positive nbroblast-Vuce cells located more distant 
from the invasive cancer cells, towards the submucosa, expressed 
little or no or-sm-actm. In 1 adenocarcinoma and in the malignant 
lymphoma approximately 50% of the TJMPl mRNA positive cells- 
expressed ot-sm-actin. Thus, the TTMP-1 mRNA expressing cells 
constitute a subpopulaiion of tumor-associated my ophiasis lo- 
cated at the invasive front of the rumor. No TTMP-1 mRNA signal 
could be identified in any of the CD68-posibve cejls (Fig. 4). 

Expression of TIMP-I and MMP-2 and 9 in colon cancer 

MMP-2 and MMP-9 are 2 type TV co]I a o enaS es expressed in the 

m^VT* ' iSSUe ° f co]or ^ adenocarcinomas. Thus 
MMF-2 has been reported to be expressed by fibroblastic cells 
in the cancer stroma,'?-' 3n d MMP-9 by macrophages at the 
Jeadmg edge of the invasive cancer.-* To directlv compare die 
expression pntterns of .MMP-2 and MlvJP-9 with that of TIMP- 1 
adjaceni sections from 5 coloreciat adenocarcinomas were hybrid- 
ized with probes for TIMP-], MMP-2 and MMP-9 niRNAs We 
Jound that the expression of TIMP-1 mRNA was located cliar- 
actenshcnUy nt the invasive from of the o JO win 5 tumor, whereas 
the expression of MMP-2 mRNA was most intense in the central 
aieas, snowing decreased expression towards the invasive front 
(Hg. do). MMP-9 mRNA expressing cells were" found nt the 
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Figure 2 -Jn situ hybridization and irronunohistochemistry for TIMP- J in human colon cancer. {/*) Twoadjacent sections from a human colon 
adenocarcinoma were incubated with polyclonal antibodies against HMP-1 (a f c) and a T1MP-1 mRNA anttsense probe (fc). '™ J^"* 
immnnoreacrt viry (red-brown color, arrows in a and c) and the TIMP- 1 mRNA (si lver grains, arrows in b) are identified m the sarne celts J^ ow » 
in a b) Iminunoperoxidase staining with the" TIMP- 1 pAb' reveals the TIMP- 1 -positive cells as fibroblast-like cells (arrows in <;) iocateo m mc 
stroma (St). No TIMP- J immunoreactivity is seen in cancer cells (Ca). a,b: bars * 50 u.m; c: bars -13 urn. (ZT) V&Ahl 
sections were incubated with CalB2 MAb ami- TIMP- 1 (a)/ NM4 MAb (b\ sheep anti TIMP- J pAb (c) or mouse airti TNP U0- The JMAW 
were detected with Envision reaeem followed by TS amplification and the sheep pAb with biotinylated rabbit anti-goat toiioweo oy 
HRP-conjugated sueptavidin (see Material and methods). The 3 TIMP- 1 antibodies react with the same cells (arrows). No irnmunoieacuvny 
seen when the sections are incubated with anti-TNP. 



invasive front like those expressing T1MP-1 mRNA but with a 
distinctly different distribution. Foci with high expression of 
T1MP-1 mRNA were not accompanied with increased expression 
of MMP-9 mRNA and vice-versa (Fig. 5b). Thus, TIMP- 1 mRNA 
expression is not coregu toted with MMP-2 or MMP-9 mRNA 
expression. 

T1MP-J in adenomas and Dukes' stage A carcinomas 

TTMP-J antigen can readily, be measured in blood and we have 
previously reported ihat levels of TIMP- 1 in blood ore significantly 



elevated in colorectal cancer patients compared to healthy donors 
and that high plasma TIMP- 1 levels are associated with short 
suivival of colorectal cancer patients. 3 '- 6 1TMP-1 has therefore 
been suggested to be a novel marker for detection of early stage 
colorectal cancer and for prognostic stratification pf colorectal 
cancer patients.-* 39 These findings, together with the characteristic 
expression pattern of TIMP- 1 afthe invasive front of virtually all 
'the colon cnnceis and die absence or rainuie TIMP- 1 expression m 
norma) and benign colon mucosa, prompted the evaluation of 
TLMP-] expression as a marker for early invasive colon cancer. 
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TTMPO !i^f^ S,UVC maCT °i*^« (red arrows i„ fc). Note Aa. some of ,h^ 7 ^ A ^'"^ Ce " s arrows in *) » 

MW J mKNA signal (red arrows in a). Bars = J3 urn. ° ■ ,he a - 5 ">-='cl,n-pos. ll ve myofibroblasts have little or no 



J^IZT'T C °, mpared T,MP " 1 mRNA «pre»k».j„ a, addi- 
? de "~« PO'VPS w,,h L expressio in an 
mRNA ^ ^ , S ' a?e A ^'o^' 3 ' ^noc^cmomas. TIMP- . 
mRNA s £nsl was delected in 2 of the 6 adenomas, whereas aj) 8 

££ve Zur,?%7? Sh ° Wed T,MP -' mRNA * *e- 

V MP- 1 SnT • POS '" Ve ade " 0ma <P edu ™.Ja.ed t,peX 

fiK ki expression was confined to a siivje focu? in 



showed ST' 9D0keS S ' 3 ° e * carcinomas analyzed 

LlvJv r ex P' c "'°" >" •nyofibroWosis located a. 

on v I nf ™ ° f ihC m " 10,S ' Wbe «« «F«sion was detected 
celk -,<<L ■ ad enonias, and ,» these was seen in fibroblasl-Jike 
.W^n 9 f k 01 ,nflam ™ li °" « *e epitbelial-stromal 
ime, face ro 2 of the cases and with arteries in the snbmucosa in I 



Discussion 

Ioc^-f.-« dy 7°f m !* n j* en '° c,ari 'y expression and cellular 
noclc- MM , P " 1,1,b " or TJMP-I b human colon ade- 

aniisen* RNA probes derived from 2 oonoverlappm" T3MP-1 
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Figure 5 - In situ hybridization for TIMP- ] , MMPr2 and MMP-9 in 
human colon cancer. {A) Adjacent sections were incubated with probes 
specific for TTMP-1 mRNA (a,c) and MMP-2 mRNA (b,J)> respec- 
tively, and is shown in brigbtfield (a, b) and darkfieJd illumination 
(erf). The TEMP-] mRNA signal increases towards the submucosa 
(Sm) whereas the MMP-2 ruRNA signal decreases and is most intense 
in the central areas <Ca). (B) Adjacent sections were incubated with 
probes specific for T1MP-1 mRNA (a,c) and MMP-9 mRNA (b t d) r 
respectively, and is here shown in bright field (a,b) and darkfield 
illumination (c.d). Both the T1MP-1 mRNA signal and the MMP-9 
mRNA signal are most intense at the invasive front towards the 
submucosa (iSm), but their expression patterns are quite different, wiih 
MMP-9 showing the most restricted expression. Bais = 100 u.m. 



cDNA fragments and specific pAb and MAbs against human 
TJLMP-1. The T1MP-) mRNA signal in all colorectal adenocarci- 
nomas investigated was seen in fibroblast -I ike cells located in the 
tumor periphery. An identical hybridization partem was observed 
with the 2 ami sense TIMP- J probes and application of comple- 
mentary sense probes on neighboring tissue sections as negative 
controls did not result -in any hybridization signal. , therefore, we 
conclude that the hybridization signal generated with the nntisense 
probes represents the genuine TIMP- J mRNA. TIMP- 1 imrmino- 
icnciivity was also distinctly located in fibroblasi-Iike stromal cells 
in_the_ tumor periphery, and these cells were identified to be the 
same cells as the TIMP- 1 mRNA expressing cells. A p/eparation 



of sheep pAb against human TTMPrT 35 and 2 well-characterized 
MAbs stained the very same cells in tbe tumor stroma.. Weak 
staining of the apical surface of some normal and- malignant 
epithelial cells was observed with the pAb in some t>f the samples; . 
No staining was obtained when roe-anti-TIMP-l antibodies were 
substituted with nonimmune goat scrum or anti-TNP MAb incW 
bated at the saine concentrations. These immunohistCK^eniicaJ . 
findings strongly suggest that the- TTMP-1 antigen detected in the 
fibroblast-like cells represents the genuine TTMP-1 protein. 

Id our study, we found TTMP-1 mRNA expression in stromal 
fibroblast-like cells located in the tumor periphery in all colorectal 
adenocarcinomas tested, whereas no expression was detected in. 
the. cancer cells jn.any-of the. cases tested.' Hiis finding is in 
agreement with studies by Zeng and colleagues, 12 ' 20 but is partly m 
disagreement with findings by Newell and colleagues. 27 In addi- 
tion to TTMP-1 mRNA signal in fibroblast-like cells in the tumor . 
peripher y, Newell a nd colleagues 27 detected a weak TTMP-1 
rnKNA signal in both benign and malignalt epithelud cells. 27 This 
observation "was, however, based on tihe use' of probes: from a 
. single TTMP-1 cDNA subclone and no additional controls; to verify 
the expression pattern. The difference between* out results and 
those of Newell and colleagues may be explained by methodolog- 
ical differences, since the procedure employed, by Newell and 
colleagues was in several steps different from the one used in the 
present study, e.g., Newell and colleagues used 3 p-labeled probes, 
whereas we used 35 s ^labeled probes. It is in this context notewor- 
thy that in order to look for a low expression level of TTMP-1 
mRNA, we performed in situ hybridization experiments with pro- 
longed exposure time (10 days vs. usually 5 days) with both our 
TTMP-1 antisense probes and both TTMP-1 sense probes, but with 
this challenge we did not detect any TTMP-1 mRNA in any 
epithelial cells. It cannot be excluded though that the TTMP-1 
mRNA is expressed in epithelial cells below the detection limit of 
our in situ hybridization procedure. 

An interesting observation in our study was the characteristic 
intense TTMP-1 mRNA and protein expression in the tumor pe- 
riphery of all colon adenocarcinomas, while little or no expression 
was seen in the center of the carcinomas. Only in the colorectal 
lymphoma did we find TTMP-1 mRNA and protein expression in 
'fibroblast-like cells located throughont the tumor tissue. The 
TTMP-1 expression pattern in the colon adenocarcinomas is in 
contrast to the expression pattern reported by Hewitt and col- 
leagues, 19 who found that the TIMP-T staining in most colorectal 
adenocarcinomas was equally intense in fibroblasts throughout the 
tumors and that some of the cases even showed decreased TTMP-1 
signal intensity towards the tumor periphery. This difference may 
be explained by possible cross-reactivity of the polyclonal anti- 
bodies employed by Hewitt and colleagues or that Hewitt and 
colleagues employed cryostat sections, while we analyzed paraffin 
sections. . 

The TTMP-1 expressing cells had a fibroblast-like morphology 
and using combined in situ hybridization for TTMP-1 mRNA and 
inmiunohistochemistry for a-sm-actin T we found that many, gen- 
erally more than 50%, of the T1MP-1 mRNA positive cells coex- 
pressed a-sm-aclin. According to the cellular morphology of the 
TIMP-T expressing cells and their localization in the invasive 
front, we could conclude that the cells were myofibroblasts and not 
smooth muscle cells. 

The myofibroblast is a cell type present in the normal colon 
mucosa, that originally was described as a pexicryptal fibro- 
blast 37 ^ and later was identified wiOi antibodies agamst a-sm- 
actin. ?7 In the lamina propria, the myofibroblasts form a continu- 
ous cell layer just below the intestinal epithelium. The pencryptal 
myofibroblasts are phenotypically different from ihe neighboring 
quiescent interstitial fibroblasts that do not express markers ol 
smooth muscle cells. 4 * During early steps of colonic: tumongenesis 
the number of myofibroblasts is significantly increased. 4 * The 
TIMP- 3 expressing myofibroblasts may be generated after activa- 
tion of the pencryp 131 myofibroblasts and/or the quiescent inter- 
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K l in colon adenoma and Dukes* stage A adenocarcin~ 




Figure 6 -In situ hybridization for TIM 
sessile adenoma of the colon (o-c) and a Dnkr** A ^^t, (A WV~ ITu - ' * ~7~". v r.*XXl J,, i! i,fl ~ uum a mocerate dyspJashc 



sbtia) fibroblasts. Adegboye°a and colleagues" hypothesized that 
the-rumor-assooated myofibroblasts originate from the quiescent 
interstitial fibroblasts of the lamina propria, rather than from peri- 
cryptal myofibroblast or smooth muscle cells, which may help to 
explain why we found some of the TIMP-1 mRNA expressing 
fibroblast-iiXe cells a-sm-actin-positive and some cc,sm-actin-neo! 
ative. , 0 

Several MMPs including MMP-2, MMP-ll and MMP- 14 are 
expressed by fibroblast-like cells in human colon cancer aw*-* 
some of which may indeed be myofibroblasts. The role of the (mvo-) 
fibroblasts m colon cancer progression is not known. Since TTMP-'l in 
human colon cancer appears only to be expressed by fibroblast- like 
celts most of which are myofibroblasts, and since high TIMP-] levels 
measured in Wood or rumor extracts from colon cancer patients are 
strongly associated with a poor prognosis*^* it could be argued 
that the TIMP-1 expressing myofibroblasts- play a nimor-promobng 
role^ lrninunomstochemical localization studies of proteins involved 
in the activation and regulation of the efficient serine protease plas- 
minogen, including ujokinase plasminogen activator (uPA) and its 
specific inhibitor PA1-] show thai both are mainly expressed by 
myofibroblasts m human beast cancer.*-* Hi £h levels of uPA and 
PAJ-l^are strongly correlated with poor prognosis in breast can- 
cel, supporting the assumption- that the myofibroblast express a 
promoting role m cancer invasion. We recently reported that the 
predominant PA1- i expressing cell in human colorectal cancer Jso is 

r^T, f ir 3nd CarIier Studies ,ndica,ed *a« elevated levels 
of PAJ- 1 m colon concer patients are associated with poor progno- 
sis- Together these findings indicate that myofibroblasts are strong 
contnbmmg , 0 the expression of proteins involved in the re 2 ..bt,W 
extraccllular matrix degrading proteases that facilitate cancer invasion 
and metastasis. 

A P^nicidnrly interesting finding of the present study was the 
absence of TXMP-, mRNA in 4 of 7 adenomas, whereas >n all of 
11 k." A r carcinoma * ,he TIMP-1 niRNA was expressed m 
fibrob as.- like celts along the invasive from. In the 2 beni " n lesions 
in which -be TIMP-1 mRNA was seen in the ndenonufnre X 



TIMP-1 mRNA positive cells were confined to a single focus with 
locally increased inflammation related to the dysplastic epithelium. 
Evident disruption of the dysplastic epithelium was observed in the 
adenoma with most intense TIMP-1 mRNA signal. Intestinal in- 
flammation may be caused by disruption of the mucous epithelium 
that leads to focal leakage of mucinous colon material into the 
lamina propria. Increased intestinal permeability is a common 
deficiency in Crohn's disease and interestingly TIMP-1 mRNA 
was found in the intestinal granulation tissue of Crohn's disease 50 
and is expressed by myofibroblasts isolated from Crohn's dis- 
ease.*' Induction of TIMP-1 in myofibroblasts in a benisn or 
preinvasive tumor may also be a response to locally increased 
MMP activity or a response to the presence of a specific MMP in 
the local microenvironment. MMP-2 and MMP-9 mRNA expres- 
sion, however, did not appear to be cpregulated with TIMP-1 
mRNA expression in the colorectal adenocarcinomas. Specific 
MMPs may indeed be involved in the transition of noninvasive to 
invasive disease; in studies of preinvasive lesions (ductal carcino- 
mas m situ) of the human breast we recently reported that MMP-13 
is specifically expressed in myofibroblasts associated with micro- 
invasive events.^ Future studies may clarify whether TIMP-1 
expression in colorectal adenomas is correlated with expression of 
specific MMPs, cytokines and/or growth factors, such as TCF-pl 
and TGF-P25J, and whether TIMP- Lean be used as a histopatho- 
logic al marker for malignancy in colorectal tumors. 
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^ZSmSST activa,or induccd by dcngue vi ™ tafecti<,n of *■-*» 
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Dengue hemorrhagic fever and dengue shock syndrome (DHF/DSS) are severe 

ST 71 T ° f den ^ e Vi ™ S ^ infeCti0n - However, the pathogenesis of nemorrha^ 
ST"™ 5 inf ^n is poorly understood. Sine! > endlelia c.lSavt g 
pivotal role m the regulation of hemostasia we studied the effect of DV infection on A, 

Sis aid a hnl ? P T" y f ^ end0thdiaI ^ human veins 
Sit avascular endothelial cell line. DV infection significantly Educed 

^hT? 1 ?" f tPA bUt n0t PAI " 1 ° f human ^dothelial cells; In additS 
^dothehalce Is was induced by DV as demonstrated by RT-PCR. SdyaS^ 
but not control antibody inhibited DV-induoed tP A production of endoS ST 

^SS^^T^^ ^ ,6Vels * f ™ - d tPA ^ foundinDHF 
out not Dr patients. These results suggest that IL-6 can regulate DV-induced tPA 

production of endothelial cells, which may play important roles in me ZZZt 

development of DHF/DSS. Copyright 2003 Wiley-Liss, Inc. * " pathogen,c 
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Real-time quantitative RT-PCRof cyclin Dl mRNA in mantle cell 
lymphoma: comparison witU FISH and immunohistochemistty. 

HuiP, HowcJG , Crouch J, Nimmakavatu M . Qumsiveh MB TalliniG PI van SD 
Smith BR . ' ' 

Department of Laboratory Medicine, Yale University School of Medicine 333 Cedar 
Street, P.O. Box 208035, New Haven, CT 06520-8035, USA. ' 

Presence of the balanced translocation t(l I ; I4)(ql 3;q32) and the consequent 
overexpression of cyclin Dl found in mantle cell lymphoma (MCL) has been shown to be 
of [rnportant diagnostic value. Although many molecular and immunohistochemical 
approaches have been applied to analyze cyclin D 1 status, correlative studies to compare 
different methods for the diagnosis of MCL are lacking. In this study, we examined 39 
archived paraffin specimens from patients diagnosed with a variety of 
lymphoproliferative diseases including nine cases meeting morphologic and 
immunophenotypic criteria for MCL by (1) real-time quantitative RT-PCR to evaluate 
cychn Dl mRNA expression; (2) dual fluorescence in situ hybridization (FISH) to 
evaluate the t(1 1 ; 14) translocation in interphase nuclei; and (3) tissue array 
immunohistochemistry to evaluate the cyclin Dl protein level. Among the nine cases of 
possible MCL, seven cases showed overexpression of cyclin Dl mRNA (cyclin Dl 
positive MCL) and two cases showed no cyclin D 1 mRNA increase (cyclin D 1 negative 
"MCL-hke"). In six of seven cyclin Dl positive cases; the t(l 1;14) translocation was 
demonstrated by FISH analysis; in one case FISH was unsuccessful. Six of the seven 
cyclin Dl mRNA overexpressing cases showed increased cyclin Dl protein on tissue 
array immunohistochemistry; one was technically suboptimal. Among the two cyclin Dl 
negative MCL-hke cases, FISH confirmed the absence of the t(l 1;14) translocation in 
both cases. All other lymphoproliferative diseases studied were found to have low or no 
cychn Dl mRNA expression and were easily distinguishable from the cyclin Dl 
overexpressing MCLs by all three techniques. In addition, to confirming the need to 
assess cyclin Dl status, as well as, morphology and immunophenotyping to establish the 
diagnosis of MCL, this study demonstrates good correlation and comparability between 
measure of cyclin Dl mRNA, the 1 1 ;14 translocation and cyclin Dl protein. 
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Elevation of topoisomerase I messenger RNA, protein, and catalytic 
act.v.tym human tumors: demonstration of tumor-type specificity and 
implications for cancer chemotherapy. ^ 

Hasaml MohlerJL, SejgjerHF, Dcsterman .flVT 

SoSSiSol 01 ^ 0,8X0 * ^ ^te, Research Triangle Fade, . 
Topoisomerase I has been identified as an intracellular target of camototherin * nl»„t 

cells to th,s drug is directly related to the topoisomerase I content, In humans the levels 
of topoisomerase I have been shown to be elevated in colorectal rumors cTmpare^ to 
normal colon mucosa. The aim of our study was to determine whether (auZSomLe 

r^f> ' ? (C) thC mCTease m topoisomerase I levels in colorectal tumors is a 
result of mcreased transcription or translation. Topoisomerase I levels were qSSSi in 
crude extracts from colorectal, prostate, and kidney tumors and flTmSt!? fl 
eount^parts by Western blotting and by direct JLa^^^^Z^ 

rumors snowed 5-35-fold increases m topoisomerase I levels, compared to their normal 
colon mucosa In the case of prostate tumors, the increase was 2-lS-foW clTareTwl 
berugn hyperplastic prostate tissue from the same patients. However n ^'dSncTwaf 

SrSf ^ f™* ° ^ topoisomerase I was determined by a quantitative 

32P-transfer assay in crude homogenates, without isolating nuclei Colorectal aM 
prostate tamors exhibited 1 1-40- and 4-26-fold increases, respect very Z c^llc 
actrvity. However, kidney tumors did not show any alteration^ cS^SSST 
compared to their normal matched samples. Thus, for all three tumor ryLesTh^as a 
good correlate between enzyme levels and catalytic activity. FinaUy^To eSalHors 
foTdTntf tor steady state mRNA level, A 2-33-fpld inlase *££a£^ 
found m colorectal tumors, compared to normal colon mucosa. These results suISat 
d erauons ,n topoisomerase I expression in humans are tumor type spec f7anS the 
ncrease in topo.somerase I levels results from either increased transcription of me 
topoisomerase I gene or increased mRNA stability. 
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Immunohistochemistry and quanta fve RT-TO T e ° he ™ teome of patients. METHODS: 
tn 43 archival specimens ci^J^l^^^V^ «* session of pKi-67 
were not treated with neo-adjuvant dw^R^S^?"^ who 
labeling index of 3 1.3% (range 10 3 66 WyTa We „ determ 'ned a median pKi-67 (MIB-1) 

™*<0.0I^9); indict 

associated with a significantly favorable t^^^i^-J^ Va,UeS Were 
correlated to prognostic outcome A miil/L^iT i P r ^ 7 ,abe,,n g were not 

indicated that tumor C^tS^^^S^ 13 ° f ^ 3nd fa <*><* . 

prognostic factors. CONCLUSION Were inde P^nt 
newprognostic indicator for prima^^eTc^ 
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Modulation of glucagon receptor expression and response in transfected 
human embryonic kidney cells. 

Ikcganii T, Cypcss AM Bouscarel B 

n^?o°f M t di ^ Ge ° rge W ^" ngt0n diversity Medical Center, Washington 
District of Columbia 20037, USA. ' 

The modulation of glucagon receptor (GR) expression and biological response was 
mvestigated m human embryonic kidney cell (HEK-293) clones permanently express.ng 

n n 1 T S?" 1116 ° R ^ CXpreSSi ° n ,evd in these doneswas 6 
upregu ated by cellular cAMP accumulation and presented a good correlation with both 

&e protein expressun level and the maximum number of glucagon binding sites 

However the determination of glucagon-induced cAMP accumulation in these cell lines 

revealed that the enhancement of receptor expression did not lead to a proportbna 

IndSbv N^ nTtT U ^^«*di*><* -e maximum cAMp" production 
induced by NaF and forskohn was not significantly different among selected clones 
regardless of the receptor expression level. High receptor-expressing clones showed' the 
greatest susceptibility for agonist-induced desensit.zation compared with clones with 
ower GR expression levels. The results of the. present study suggest that the GR can 
recruit non-GR-specific desensitization mechanism(s). Furthermore, the partial inhibition 
or alteration of the overall cAMP synthesis pathway at the receptor level may beT 
ex^eSo'nt^ *" * ^ 3 massive increa * e * — brane receptor 

PMID: 1 1546678 [PubMed - indexed for MEDLINE] 
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Developmental regulation of acidic fibroblast growth factor (aFGF) 
expression in bovine retina. 

Unite de Recherches Gerontologiques, U. 1 18 INSERT, Paris, France. 

vSSS?! 35 -^ ^^^ i^^^gmoleculeirnplicatedinawide 
vanety of biologic*! processes such as cell growth, differentiation and survival. It has 
been punned from bovine retinal The present study was carried out to detect which cells 
m the bovine retina expressed aFGF at the different stages of embryonic and post-natal 
u^a^j? 1 * 1 *^ 1 specific aFGF mRNA and protein were detectedby in situ 
ST? r ^P^e nboprobes and immunocytochemistry using affinity purified 
polyclonal human recombinant aFGF antibodies respectively. No signal was detected by 
either techmque until 4-5 months and then there was progressive expression of aFGF 

HSl£Z °5 the rCtina - By 8 - 9 months of e^ryonic development, 
,nucle,ofthe3 neuronal layers (ganglion cell layer, inner and outer nuclear layens) were 

rlrSr l Tfu b ,a J elCd ; A S , Hght labding ° f fte P^ 6 "^ epitheiium of the 
retina was also visible throughout development and maturation. These results showed a 
good correlation between message and protein expression in these cell types. In contrast 
glial cells m the nerve fiber layer and vascular endothelial cells displayed a nuclear 
immunostairung for the prdtein in the absence of message. These.data suggest that aFGF 
plays a role m the late steps of retinal differentiation by autocrine and paracrine 
mechanisms. 
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The p21(Cipl) protein, a cyclin inhibitor, regulates the levels and the 
intracellular localization of CDC25A in mice regenerating livers. 

iaimeM, Pujol MJ , ScrratosaJ, Pantoia C . Canela N. Casanovas O . Serran o M 
AgcH N. Backs O . ' 

Department of Cell Biology and Pathology, Faculty of Medicine, Institut d'Investigacions 
Bumediques August Pi Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain. 

Liver cells from p2l(Cipl-/-) mice subjected to partial hepatectomy (PH) progress into 
DNA synthesis faster than those from wild-type mice. These cells also show a premature 
mduchon of cyclin E/cyclin-dependent kinase (CDK) 2 activity. We studied the 
mechanisms whereby cells lacking P 21 (Cipl) showed a premature induction of this 
activity. Whereas the levels of CDK2, cyclin E, and P 27(Kipl) were similar in both wild- 
type and p2l(Cipl-/-) mice, those of the activator CDC25A were much higher in 
p21(CipW-) quiescent and regenerating livers than in wild-type animals. Moreover, 
p21(Cipl-/-) cells also showed a premature translocation of CDC25A from cytoplasm 
into the nucleus. The ectopic expression of p21(Cipl) into mice embryo fibroblasts from 
p21(Gipl-/-) mice decreased the levels of CDC25A and delayed its nuclear translocation 
-The levels of CDC25A messenger RNA in p2l(Cipl-/-) cells were higher than in wild- 
type cells, suggesting that this increase might be responsible, at least in part, for the high 
levels of CDC25A protein in these cells. Thus, the results reported here indicate that 
p2l(Cip 1) regulates the levels and the intracellular localization of CDC25A! We also 
found a good correlation between CDC25A nuclear translocation and cyclin E/CDK2 
activation. In conclusion, premature translocation of CDC25A to the nucleus might be 
involved in the advanced induction of cyclin E/CDK2 activity and DNA replication in 
cells from animals lacking p21(Cipl). 
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Alteration of frizzled expression in reual cell carcinoma. 
Janssens N, AndriesL, JanjcotM, PereraT, jgakkcrA. 

Department of Biochemistry, University of Antwerp, Wilrijk Belgium 
njansse9@prdbe.jnj.com 

To evaluate the involvement of frizzled receptors (Fzds) in oncogenesis, we investigated 
mRNA expression levels of several human Fzds in more than 30 different human tumor 
samples and then corresponding (matched) normal tissue samples, using real-time 
quan^ttvePCR. Weobserved that me mRNA level of FzdS L^r4l7in"Led in 
8 at 1 1 renal carcinoma samples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
carcinoma samples. Western blot analysis of crude membrane fractions revealed that 

S ta B SfT 8 ^™ matChed tumor/noiroal samples correlated with the 
observed mRNA level. Wnt/betaKatenin signaling pathway activation was confirmed by 
the increased expression of a set of target genes. Using a kidney tumor tissue array FzdS 
protein expression was investigated in a broader panel of kidney tumor samples FzdS 
ZT^T* "T 8 7 3S **? Bd m 30% ° f dear cel1 Cinemas, and there was* strong 
exorll o " Z T ™ ^ ^ staini "S in the samples. Our data suggested that altered 
expression of certam members of the Fzd family, and their downstream targets, could 
provide alternative mechanisms leading to activation of the Wnt signaling pathway in 
renal carcmogenesui. Fzd family members may have a role as a biomarker 

PM1D: 15557753 [PubMed - indexed for MEDLINE] 
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Alteration of Frizzled Expression in 
Renal Cell Carcinoma 
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Abstract 

To evaluate the involvement of fr»2zled receptors (Fzds) 
in oncogenesis, we investigated mRNA expression lev- 
els of several human F2ds in more than 30 differ- 
ent human tumor samples and their corresponding 
(matched) normal tissue samples, using real-time quan- 
titative PCR. We observed that the mRNA level of Fzd5 
was markedly increased in 8 of 11 renal carcinoma sam- 
ples whilst Fzd8 mRNA was increased in 7 of 11 renal 
carcinoma samples. Western blot analysis of crude 
membrane fractions revealed that F2d5 protein expres- 
sion in the matched tumor/normal kidney samples corre- 
lated with the observed mRNA level. Wnt/p-catenin sig- 
naling pathway activation was confirmed by the in- 
creased expression of a set of target genes. Using a kid- 
ney tumor tissue array, F?d5 protein expression was 
investigated in a broader panel of kidney tumor samples. 
F2d5 membrane staining was detected in 30% of clear 
cell carcinomas, and there was a strong correlation with 
nuclear cyclin Dl staining in the samples. Our data sug- 
gested ihat altered expression of certain members of the 
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Fzd family, and their downstream targets, could provide 
alternative mechanisms leading to activation of the Wnt 
signaling pathway in renal carcinogenesis. Fzd family 
members may have a role as a bibmarker. 

: ~~ ■ Copyright © 2004 S^Kiroer AG. B*s*l 



Introduction 

The Wnt signaling pathway is evolutionary conserved 
and controls many events during embryonic develop- 
ment. Members of the Wnt gene family of secreted glyco- 
proteins are involved in embryonic induction, generation 
of cell polarity, cell proliferation and the determination of 
cell Tale [ J, 2 J. Recently, it has become evident that the 
Wnt pathway is also deregulated in a range of tumors [3]. 

The Wnt signaling pathway is activated when Wnl pro- 
teins bind to a cell surface receptor complex consisting of 
a member of the frizzled receptor (Fzd) famiry and either 
low-density-lipoprotein receptor-related protein (LRP)5 
or LRP6 [4, 5], A detajJed characterization of the Fzds. 
and the immediate downstream events after Wnl binding 
has been hampered by the lack of pure biologically active 
Writs. 

Downstream of the receptor complex, ihree pathways 
may be initiated, depending on the composition of the 
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ligand and receptor complex The "Wnt/0-catenin path- 
way*, lhe *Wnr/Ca 2+ pathway* or the "Wnt polarity path- 
way* {6]. The Writ/p-catcnin pathway has been linked to 
carcinogenesis. Genetic alterations in components of this 
pathway (adenomatous polyposis coli, APC, axin and 
catenin) can result in the accumulation of non-phosphory- 
Jated P-catenin [3, 7] and this can promote carcinogenesis. 
Conversely, neither the Wnt/Ca 3+ pathway nor the Wnt 
polarity pathway involves the activation of 0-catenjn [for 
review, see ref. 1,6). 

Mutations in one of the three regulatory genes (APC, 
P-catenin and axih), overexpjession of Wnts and Fzds or 
the expression of a const it u lively active Fzd have been 
. linked to Wnt/P-catenin pathway activation in various 
tumors [8, 9]. 

To evaluate the involvement of Fzds in oncogenesis, 
we investigated mRNA expression levels of several hu- 
man F2ds (Fzd2 f 3, 5, 6, 7, 8 and 9) in more than 30 differ 
ent human tumor samples using real-time quantitative 
PCR. Each sample was compared with its corresponding 
(matched) normal tissue sample. The most striking obser- 
vation was the dramatically increased Fzd5 and FzdS 
mRNA expression seen in the renal carcinoma samples. 
This was confirmed at the protein level using Western 
blotting. Kidney tumor tissue arrays confirmed Fzd5 
membrane staining in 30% of clear cell carcinomas, with 
nuclear cyclin Dl showing a strong correlation with the 
Fzd5 membrane labeling. Fzd8 protein expression analy- 
sis was not performed due to the lack of suitable reagents. 
These data suggest that Fzd5 may have a role in renal cell 
carcinogenesis due to its frequent overexpression ob- 
served in these tumor samples. Potential future applica- 
tions could include uses in tumor targeting or as a poten- 
tial biomarker. 



Materials and Methods 

Tissiie Samples 

Fro2cn tumor tissue samples wiih corresponding normal tissue 
from the same patient were derived either from human biopsy or 
autopsy material {Department or Pathology, University of Antwerp, 
kindly provided by Prof. E. Van Marck). Tissue specimens were 
snap-frozen in liquid nitrogen and kept at ~30*C unii! use. Frozen 
sections pr kidney tumor and normal tissue samples were stained 
with bcmatoxylm-eosin to support the pathologist's observations and 
to confirm ihetypc or kidney tumor. Paraffin-embedded I issue slides 
of rensl carcinoma. lung carcinoma, breast and colon carcinoma 
were obtained, after encryption, from lhe Department of Paihology 
(Middelheim Hospital. Antwerp. Belgium). The CU human kidney 
cancer (Super BioChips Laboratories) tissue array used in I his study 
contained 59 tissue samples consisting of 9 normal kidney tissues. 



30 clear cell renal carcinoma samples and another 20 renal cell tumor 
types (chromophiK chromophobe, papillary type* collecting duct car- 
cinoma and samples with rimed types). 

RtfA Isolation and Reverse Transcription 
Total RNA was extracted from tissue specimens using LOtraspec 
Reagent (Biotecx. USA) according to the manufaciurcr/s instruc- 
tions. All total RNA was routinely treated with DNase (DNA-frec kit, 
Ambion, USA). I ug of total RNA was used to synthesize cDNA 
using oiigb-dT primers (Superscript; Invitrogen, Merelbeke, Bel- 
gium). Reverse transcription was performed at 42 *C for 60 min, loir 
lowed by 70 p C.for 1 0 min. 

Real-Time PCR . 

ReaMime PCR was performed on either an AB J Prism 7700 or 
7900 Sequence detection system (perkin-Elmer Applied Biosystcms, 
Foster City, Calif., USA) using the 5' nuclease assay (Taqrnan™). 
Primer and probe sequences were designed using Primer Express (PE 
Applied Biosystcms) and are shown m table I. Quantitative values, 
were obtained from the threshold cycle number (Ct) at which the 
increase in the signal associated with exponential growth of PCR 
products is detected using PE Biosystcms anarysis software, accord- 
ing to the manufacturer's instructions. . - V 

We have used the 2" A AO method to analyze the relative changes 
in gene expression of the different genes between tumor and corre- 
sponding normal tissue samples. We used the mitochondria! ATP 
synthase 6 (ATPsy6) as the endogenous RNA control [ 1 0; Janssens ct 
a)., in prep J, and each sample was normalized to its ATPsy6 content. 
The relative expression of the target gene was also normalized to the 
corresponding normal tissue sample (calibrator). Results, expressed 
as the amount of target sample relative to the ATPsy6 gene and the 
calibrator, were determined as follows, N = 2-» a ° Mm P k - aontfcrwotf 
where the AO values of the sample and calibrator were, determined 
by subtracting the average Ct value of the sample and the calibrator 
from the average Cl value of the ATPsy6 gene. Amplification was 
done essentially as described previously f 10). Briefly, 50 pi of reac- 
tion mixture containing 1 pi of cJDNA template were amplified as 
follows; incubation at 50* C for 2 rain, denaiurarion at 95 *C fox 
1 0 min, and 50 cycles at 95 ° C for J 5 s and 60* C for J m in. 

Membrane Preparation, Gel Electrophoresis and Imrnvnoblotting 
Tissue samples were weighed, suspended at a 40 times dilution 
|= 40 volumes/original wet weight of tissue (Ww)] in 50 mM Tris- 
HCI buffer. pH 7.4, and homogenized with an UHra-Turrax homoge- 
nizes After centrifugaiion for 10 min, 24,000$ at 4*C, the pellet was 
washed three limes by resuspension in the Tris-HO buffer followed . 
by centrifugaiion. The final membrane pellets were stored at -80°C 
in the Tris-HCI buffer at a concentration of 0.5-1 mg/ml. The Brad- 
ford protein assay (Pierce, Aalst, Belgium) was used for protein deter- 
mination. Proteins (50 ug) were separated, by 8% SDS^PAGE and 
transferred to nitrocellulose membranes. After primary and second- 
ary antibody incubation, the anrigen-aritibody-peroxidase complex 
was detected by cheroiiuminescence (Pierce, Aalst, Belgium) accord- 
ing to the manufacturer's instructions. 

Immunohistochemistry 

Immunohisiochemistry was performed on 10-ujVAihick cryoscc- 
lions of unfixed tumor tissue and on 6-nA/- thick para ITm "sections 
from renal tumor tissue fixed by formalin or by an alcohol-based fixa- 
tive. Adjacent tissue blocks from renal tumors were processed with 
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Table 1. Real-time PCR primer and probe sequences 



Target cDNA 


! Primer/probe sequences* 


Fragment 
position b 


Accession 
No* 


FZD2 


(a) 5*-alcccEtccecE£c-3* '" 

(b) 5'-giatttg3icatglagatcgtgaagtc-3' 

(c) 5'*FAM-Iacaegccgcgcalglegc-TA M RA-3* 




# 


FZD3 


(aV 5'-tcac**cca ct pcatp&p. V 

(b) 5'»ttgicaccilcaat»ultcatcg-3' 

(c) 5*-FAM<atccccggaactciaaccaicatcctm-TAMRA-3 # 


!/•/ J—J 


/\D\Jjy IJ.J 


F2D3 


(a) 5*-l fcCcaapelraetlccfriir-'V 

(b) i'-tcictaagicgccgegO*. 

(c) 5^FAM-ccttc3lggigclgilgccccc-TAMRA-3' 




• rtoU*MJl o 


FZD6 


(b) 5'-cccagagagiciggagalggatO' 

(c) 5'-FAM-tglg£ti&aacctgcclcgccag-TAMRA-3' 


2,094-2, J 70 


AF072873 • 


F7D7 


(a) 5'-cctgjgg3aaggcataacigtg-3' 

(b) 5 -aaccaacgggaaacttcaga-3' 

(c) 5'-FAM-aagcoacilttaiaggcaaagcagtgcaa-TAMRA-3' 


2,687-2,762 


AB017365 


r/X/o ■ 


(a) 5*-ig!ggtcggtgclctgcU-3' 

(b) 5'-cgctccaigtcgataaggaag-3' 

(c) 5'-FAM-ccacct tcgccsccgtcEcca-TAMRA-i' 


853-919 


" AB043703 


FZD9 


(a) 5'-ccccgggagctacggacO' : 

(b) 5 -lagiC3lgtgc3agaccacge-3* 

(c) 5vFAM-tggc3Cgcactgccaclataaggct-TAMRA-3'. .. . 


J,696-J,763 


HSU82I69 


ATP$y6 


(a) 5'-ggtgtaggtgigcct!gigg|-3' 

(b) 5'-gggcgcaglganataggettO' 

(Oi'-FAM-aagigggclagggcaniiiaatcitagagc^TAMRAO' 


580-503 


AF36827I 


c-myc 


(a) 5'-accaccagc3gcg3elciga-3' 

(b) 5Mceagcagaaggtgatccagacl-3' 

(c) S^FAM-Dcctuigccaggagccigcctci-TAMRAO' 


1^297— J 413 


HSMYCl 


Cyclin Dl 


(a) 5'-gaacctggccgcaaigac-3'- 

(b) 5'-egectciggcailiiggaO' 

(c) 5'-FAM-ccgcacgaitica»g3acacii-TAMRA-3' 


4,143-4,211. 


AF5! 1593 


PPAR5 


(a) S'-agcatectcaecggcaaaO' 

(b) 5'-gtctcgaigicgiggotcaca-3' 

(c) 5'- FAM-ccagccacacggcgcccl-TA MR A-3' 


932-990 . 


NM-006238 . 



a (a) = Sense primer, (b) = anlisensc primer, (c) = probe. 

b Fragment positions are given according to the EM BL/GenBank accession No. of doned sequence. 
. c EMBUCenBank accession No. of cloned sequence. 



formalin and with the alcohol-based fixative. Paraffin and cryosec- 
tions wctc mounted on poly-I-lysine or 3-aminopropyltrietboxys>- 
lane-gtlaiin-coated slides. The 59 tissue samples on the CLI human 
renal cancer tissue array slides were all fixed with formalin and 
embedded in paraffin, and the sections were mounted on silane- 
coatcd slides (Super BioChips Laboratories). In addition to rena) car- 
cinoma tissue, sections from 10 formalin- fixed pa rafTm-em bedded 
lung carcinomas were stainrd for Fzd5. CoJon and breast tumors 



were used as positive controls for 0-caienin and cyclin Dl immuno- 
staining. 

The following primary antibodies were used: Fzd5 (Upstate Bio- 
technology), p-catenin (Zymed), cyclin DJ (Zymed). E^cadherin 
(Novaccstra) and cytokeratin S (Biogenex). Cryosections were fixed 
in 4% paraformaldehyde for 5 min; acetone for 5 mm at -20*C and 
70% ethanol for 5 min. Endogenous peroxidase, activity was 
quenched using 3% H;CK Paraffin sect ions of formalin- and alcohol- 



Friz2led Receptors in Renal Cell Carcinoma 



Tumor Biol 7004;? X 161-171 



163 



Table 2. F2d mRNA expression in tumor samples 



Sample Tissue 



Tumor rype 



x-fold expression increase** 



FZD2 



F2D3 FZD5 FZD6 F2D7 



FZD8. 



F2P9 



133702 kidney - adenocarcinoma 

137770 kidney renal cell carcinoma 

138844 kidney renal eel) carcinoma 

137146 kidney renal cell carcinoma 

137564 kidney . renal cell carcinoma 

.133408 kidney renal cell carcinoma 

139188- kidney renal cell carcinoma 

13569? - kidney renal cell carcinoma 

139064 kidney . renal cell carcinoma 

134585 kidney renal cell carcinoma 

140279 kidney jena) cell carcinoma 



0.17 


0.5 


3J2 


2.13 


0.06 


1.32. 




1.23 




ft 16 


■ — -U I 


I. J 


-8.2 J 


X44 


0.31 


ft 1 1 

V, 1 I 


Q.Qt 


I Jo 


VUJ 


3.J8 


2.8 


047 


ft 4S 


J.JO 


1.13 


A "71 

0.73 


4.42 


1.37 






0 A 

7.0 


I.J / 


7.64 


3.52 




23.97 


0.98 




7 Aft 




J 0.72 




3.7 


0.56 


0.66 


1.33. 


0.37 


4.41 


K53 


2.6 * 


0.36 


.4.83 


6.9 


0.34 


2.54 


2.31 


536 


1.82 


\25 


2.36 


1.8 


2.19 


2.85 


1.47 


0.72 


1,38 


0.47 


0.09. 


04 




7.33 


J8J7 


3.93 


6.05 


6.41 


4.tf5 




0.4 


339 


0.49 


0.44 


0.92 


0.54- 


0.33 


0.7 


5,17 


122 


2.19 


o;4 


0.11 




0.39 


3.29 


236 


1.34 


3.94 


.0.67 


0.5? 


3.45 


3.36 


0.58 


1,22. 


1.67 


124 


1.99 


5.46 


634 


4.42 


6.57- 




6.08 


0.15 


4.01 


433 


0.3 


1.16 


0.54 


0.55 


0.12- 


1.87 


1.07 


1.62 


1.47 


0.51 


2.07 




0.66 


1.4! 


1.01 


1.33 


024 


0.79 . 


4.87 



137252 
138256 
146472* 



ovary 
ovary 
ovary 



carcinosarcoma • 
papillary carcinoma 
serous papillary carcinoma 



145845 
146145 
146630 
146633 
147055 



colon 
colon 
colon 
colon 
colon 



adenocarcinoma 
adenocarcinoma . 
adenocarcinoma 
adenocarcinoma 
adenocarcinoma 



142253 lung adenocarcinoma 0.67 5. J 7 0.43 

143036 lung adenocarcinoma 0.67 1.24 2.63 

138938 lung adenocarcinoma 0.93 0.88 1.07 

133563 lung adenocarcinoma 2.76 1.23 0.31 

144387 Jung •-. adenocarcinoma J7.85 0.43 0.49- 

137304 lung acinary adenocarcinoma 0.54. 9.15 1.65 

144546 lung epithelial carcinoma 0.1 0.67 0.21 

137621 lung epithelial carcinoma J.52 2-19 0.37 

145552- lung epithelial carcinoma |.09 - 0.J6 



. 1.87 
1.13 
1.3 
134 
0.25 
1.25 
8.1) 
226 
0.91 



033 

Ml 

1.73 

0.99 

2.73 

0.63 

0.27 

0.52 

1,47 



0.59 
1.04 
0.38 
0.78 
0.54 
3.4 J 
0.97 
0.45 
0.13 



0.41 
9.19 
0.76 
4.97 

2.64 
IJ2 
1.06 
0.5 



J43987 testis embryonal carcinoma 0.24 0.6 

137332 stomach leiomyoma 19.32 3.9 J 

139026 stroma gastrointestinal carcinoma 66.1 331 

136049 rectum adenocarcinoma . J.7J 0.52 

140794 gallbladder adenosquamous carcinoma 0.19 



19.43 
0.45 
0.03 
0.4 
0.93 



1.33 
1.53 
6.67 
0.54 

3.02 



2332 
18.98 
8.44 
0.29 
0.2 



2.43 
32.82 
2.28 
0.88 



033 
16.7 
20.82 

02Z 



* Sample identification numbers were given by the pathologist. 

Results are expressed as x-fold increase of the gene in the tumor tissue sample compared to its matched normal tissue sample after 
normahzing both samples on the basis of their ATPsy6 content. A cuio/T of 3-fold was used to define differentia) expression. Significant 
<> 3-fold) mcreascs m the expression level of the Fzd receptor are shown in italics. - = Expression of the target gene undetectable in one or both 
. samples (tumor and/or normal). 



fixed tissue were processed with a trypsnveil rate- microwave pre- 
treatment or with an £DTA- microwave preireaimem to unmask epi- 
topes, respectively. Sections were then sequentially processed with 
primary antibodies, biptinytoted secondary antibodies and slreptav- 
idin-bioiin-peroxidase (F 2 d5. £-cadherin and cyiokeratin 8). For 
catenin. polyclonal rabbit antibody with the EnVision detection sys- 
tem (DAKO) was used. The slides were further developed using~3- 
amino-9-cihylcarba20le. countersiained with hemalaun and mount- 



ed with glycerin gelatin. Stained sections were observed with an 
Axioplan 2 microscope equipped with an Axiocam digital camera. 
Staining intensity for p-catenm was scored as no staining (value 0), 
weak and fragmentary staining of cell membranes (value 1), moder- 
ate membrane staining of less than 50% of the tumor cells (valuc-2). 
moderate membrane staining of more than 50% of tumor eeJIs (value 
3) and strong membrane staining of more than 75% of tumor cells 
(value 4). The cydin Dl staining was quantified as a percentage of 
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F»9* 1. Fid5 protein expression in matched 
iwnor/norma) kidney samples. T = Tumor 
sample; N * matched normal sample. Sam- 
ple identification numbers are given by the 
pathologist. 
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cyclin DMmmunoreactWe nuclei in tumor cells in three fields (area: 
J 8,64 1 pm J ) of each tumor sample: The total number of tumor nuclei 
ranged from 5 1 to 164. The correlation between Fzd5 and 0-caienm 
staining and between Fzd5 and cyclin Dl staining was evaluated by 
ihe Mann-Whitney U test. 



sion level was observed to be relatively low in these hmg 
tissues compared to the oiher tissues investigated. 



Resuhs 

Fzd mRNA Expression in Matched Human 

Tumor/Normal Tissue Samples 

Fzd expression in tumor tissue was compared with Fzd 
expression in matched normal tissue samples and normal- 
ized to the expression of the housekeeping gene mitochon- 
drial ATPsy6 (table 2). A 3-fold increase was considered 
significant. . 

In the kidney tumor samples, in which 10 of 1 1 sam- 
ples were clear cell carcinomas, Fzd5 was upregulaled in 8 
of the I J samples. A similar observation was made for 
Fzd8 and Fzd2, which were upregulated in 7 and 5 renal 
tumor samples, respectively. None of the other Fzds 
showed consistent upregulation. 

Both F2d2 and Fzd3 were upregulaled in 3 of 5 colon 
adenocarcinoma samples. No other Fzd expression was 
significantly different compared to Ihe normal colon tis- 
sue sample. Fzd3 showed an increased expression in all 3 
ovarian carcinoma samples. Fzd expression was not al- 
tered in any of the lung tumor samples. The Fzd expres- 



W est em Blot and J mmunohistochemistry Analysis on 
Renal Carcinomas 

Western blotting was used to evaluate Fzd5 protein 
expression in the renal tissue samples used for mRNA 
expression analysis. Membrane fractions of the renal car- 
cinoma and corresponding normal tissue samples were 
prepared. As previously shown (table 2), Fzd5 mRNA 
upregiilation was detected in 8 of the 1 1 matched tumor/ 
normal samples. Increased expression of Fzd5 protein 
was seen in membrane fractions from 9 of 1 1 samples 
(fig. 1). In most cases, concomitant increases in Fzd5 
mRNA and protein levels were observed. 

Hematoxylin-eosin staining of the cryosect toned' tu- 
mors confirmed the presence of clear cell carcinoma, 
Fzd5 immunostaining in clear cell carcinoma (fig. 2a, b) 
was observed to be localized to cell membranes and to 
nuclei. Cytokeratin 8 (fig. 2c) and E-cadherin {fig. 2d) 
were also detected. E-cadherin labeling of cell membranes 
in clear cell carcinoma was less intense and patchy com- 
pared to epithelial cells Of normal renal tissue. P-Catenin 
staining was confined to Ihe cell membrane. 0-Catenin 
levels in the clear cell carcinoma membranes were highly 
variable. Nuclear 0-catenin staining was not observed in 
any of the samples. Epithelial cells in nprmal renal. tissue 
showed intense membrane staining and some cytoplasmic 
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Staining. Id addition, weak p-catenm staining of endothe- 
lial cells was observed. A high number of cyclin Dl- 
immunoreactive nuclei was observed in clearcel) carcino- 
ma (fig. 2f). 

On the CL1 human kidney cancer tissue array, 30% 
(n = 9) of the dear eel) carcinoma tumor samples (n = 30) 
showed Fzd5 immunoreactivity (fig. 3a), Membrane-a&- 
sociated {Vcatenin staining was observed in 33% of the 
Fzd5-positive tumor samples and 57% of Fzd5-negative 
clear renal cell carcinoma samples (table 3; fig. 3c, d). 
Again, nuclear p-catenin staining was never observed. 
Statistical analysis did not reveal a diJTerehce in the 
expression of 0-catenin between Fzd^positive and Fzd5- 
negative tumor samples (fig. 4a). 

Nuclear cyclin DJ. was observed in.89% of the Fzd5- 
positive clear cell carcinoma sampled (table 3; fig.3e). 
Only 38% of the Fzd5-negative clear cell carcinoma sam- 
ples; contained nuclear cyclin DK Statistical analysis 
showed a significantly higher cyclin Dl expression in 
Fzd5-positive compared lo Fzd5-negative tumor samples 
(fig. 4b). 



Expression of PPAR5 was investigated because it repre- 
sents a direct target of the Jkralenin pathway with T cell 
factor binding sites in its promoter. Expression of c-mvc 
was found to be upregulated in 7 of II, whilst cvdin Dl 
was upregulated in 10 of 1 1 kidney tumor samples (ta- 
ble 4). PPAR5 was upregulated in 9 cases. AU three Select- 
ed target genes showed a marked upregulation in . the 
majority of renal tumors, which Suggested that the Wnt/ 
P-catenin pathway was activated in these samples. 

Discussion 



c-mya Cyclin DJ and Peroxisome 
Ptoliferalor-Aciiyated Receptor S Expression in Renal 
Carcinomas 

Wnt/p-catenin.pathway activation in the kidney tissue 
samples was investigated looking at the expression of a 
number of target genes, which have previously been 
shown to be upregulated when the pathway is active. 
Gene expression of c-myc, cyclin Dl and peroxisome 
proliferator-activated receptor 5 (PPAR5) was analyzed. 
Increased expression of both c-myc and cyclin Dl genes 
have been implicated in cell proliferation, and carcino- 
genesis, and they represent two of the more important and 
closely studied target genes of the Wnt signaling pathway. 



Rg. 2. Disinbuiion of Fzd5 (a, b), cytoktratiji Z (c), E-cadherin <d) 
and p^atcnm {*) immunoreactivjty in para/fin sections from a renal 
lumoj processed by an alcohol fixative. From the same tumor a fcr- 
malm-fixed block was used for cyclin DJ iromunosiaining (#). Fzd5 
immunosiaimng shows distinct immunoreaciiviiy i„ cell membranes 
and in nuclei of clear cell renal carcinoma. Gear cells are immunore- 
activeforcytoJceratin 8.0-Catenin and E-cadherin staining of mem- 
branes is rather weak, and not uniform, in clear cell renal carcinoma. 
Nuclear 0-catenin immunoreaciiviiy Wa s not observed. In clear cell 
renal carcinoma, many nuclei showed cyclin Dl immunoreaciiviiy 
nie inset ml shows a detailed view of ihecydin Dl labeling of nuclei 
»n clear celj renal carcinoma. 



Fzd family member overexpressjon has been postu- 
lated to play key role.s in different tumor types such as 
esophageal carcinoma [1 J J, gastric cancer [12] and head 
and neck squamous, cell carcinoma [13]. The current 
study evaluated the potential implication of Fzds as 
tumor-associated antigens in different tumor types. We 
screened a number of matched normal/tumor tissue sam- 
ples for the expression of a variety of Fzds using real-time 
quantitative PCR. 

Results obtained revealed that both Fzd5 and Fzd8 
mRNA were overexpressed in the majority of renal carci- 
noma samples when compared to the matched normal 
kidney samples. Fzd2 and Fzd3 were upregulated in 3 of 5 
colon adenocarcinoma samples. Fzd3 was also upregu- 
lated in the ovarian tumor tissue samples compared to the 
matched normal tissue samples. None of the other Fzds 
evaluated showed a specific differential expression pat- . 
tern in any of the samples studied. Fzd5 and Fzd 8 show 
69.1% similarity and belong to the same subgroup of Fzds 
[14]. The significantly higher expression of Fzd5 and. 
Fzd8 in the renal tumor samples, as compared to the nor- 
mal renal samples, suggests a higher probability that this . 
subgroup may be implicated in the progression of renal 
cancer. Therefore, we decided to further examine the pos- ' 
sible role of Fzd5 in renaJ carcinoma. 

We observed, using Western Wotting, that protein lev- 
els were mostly consistent with mRNA levels in the tumor 
samples. Jn order to be able to determine the Fzd5 expres- 
sion in a broader range of kidney tissues, we utilized a 
tissue array. Fzd5 membrane staining was detected in 9 of 
30 (30%) clear cell carcinomas, and importantly, mem- 
brane staining was not detected in the matched 9 normal 
. kidney tissue samples. 

Since the Wnt signaling pathway appears to play an 
important role in embryonic development, in particular 
embryonic kidney induction [ J 5, 16], activation of this 
pathway in the adult kidney due to mutation or overex- 



Fn"22»ed Receptors in Renal Cell Car. 



Tumor Biol 2004^3; 161- 17| 



167 




50 jirffc 




50 tfm . 




Array sample 45 



50 jim 



Tumor Biol 2004:?5; 161-17) 



Jansscns/Andrics/Jankol/Pcrera/Baickcr 



Tabic 3. Correlation between Fzd5 and 
0-catenin or cyclin Dl expression 



Fzd5,% 



frCareriin.+ 
^Catehin- 
Cydin Dl + 
CycKn Di - 



33 
.67 
89 
II 



57 
43 
38 
62 



pression of one of the components of (he pathway could 
be a determining factor in the development of renal can- 
cers Therefore, several studies have looked into the possi- 
ble function (he Wnt/^catenin pathway plays in renal 
carcinogenesis. AFC gene mutations have been demon- 
strated not to be involved in renal carcinom'a [17,18] In 
addition, 0-catenin mutations are rare events in renal car- 
cinoma [19, 20], Nevertheless, cytoplasmic accumulation 
of p-catemn has.been reported in a number of renal cell 
carcinomas [19], and thus the Wnt signaling pathway 



Sample Tissue 



Tumor type 



x-fo)d expression increase*' 



133702 

137770 

138844 

137146 

137564 

13340B 

139188 

135699 

139064 

1345*5 

140279 



kidney adenocarcinoma 
kidney renal cell carcinoma 
Icidney renal cell carcinoma 
kidney renal cell carcinoma 
kidney renal cell carcinoma 
kidney renal cell carcinoma 
kidney renal ce» carcinoma 
kidney renal cell carcinoma 
kidney renal cell carcinoma 
kidney renal cell carcinoma 
>*dney renal cell carcinoma 

; ^mpic i dcm)75ca| ; numbers were given bv thepathoic^T 

^ifsmiS 

XTTsy6 content. A cutoff ^lafused" define M ^ °" ^ 0fth ™ 
>^«c«„* the session 



c-myc 


cyclin Dl 


WAR* 


0.54 


4.52 


0 r 52 


J3.9 . 


28.91 


5.53 


2J9 


31.49 


7.48 


7.62 


1538 


3.15 , 


33.82 


1965 


8.65 


7.8 


8.92 


4.86 


222 


9.92 ~ 


1 1.67 


12.18 


22.73 


5J3 . 


22.1/ 


5.04 


6.33 


1.79 


1.14 


1.67 


6).68 


54.95 


14.62 



0 



8 



b 60 

g 50 
"3 

c 

"g 30 
.? 20 

J »• 

o 

d 0- 
-10 J 



- P < o„ot ■ 



CD 



Fad5- 
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Fig. 4. Box plot charts (thick black line * 
median) illustrating the relationship be- 
tween Fzd5 immunostaining and 0-catenin 
(») and cyclin Dl (b) expression in clear cell 
renal carcinoma. No significant correlation 
was observed bet ween the f^caienin scores of 
Fzd5-positive and -negative clear cell renal 
carcinoma. Nuclear cyclin. Dl staining in 
clear cell renal carcinoma showed 3 signifi- 
cant. di/Tereoce between FzdJ-posmve 3nd 
F2d5-negative tumor samples. 
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might act as an inducer of tumorigenesis in the kidney, nin was not observed in any of the tumohor on the tissue 

This view is supported by the observation that aberrant array. This result is consistent with the data presented for 

activity of the Wnt signaling pathway has been reported in renal cell carcinomas by Kim et al. (I9J. They did not 

renal-cancer-derived eel) lines. Zang et al. [2 1 J observed a detect nuclear 0-catenin staining in the 52 renal celJ carci- 

higher expression level of WntSa and Fzd5 mRNA in the nomas examined. The lack of nuclear p-catenin staining 

renal cancer cell line GRC-1 than in the normal renal cell has also been reported by others in tumors that might 

line HK-2. Expression of 0-cateniri was also higher in have arisen from Wnl/0-catenin pathway activation [28- 

GRC-1 than in 31]. 

To determine the status of the canonical Wnt signaling While expression of both Wnt5a and F2d5 does induce 
pathway in our renal carcinoma samples, we have quanti- duplication of the Xenopus head, exogenous expression of 
tated the mRNA levels of three important target genes of Fzd5 in a Xenopus model does not induce duplication of 
T cell factor/Jymphoid enhancer factor activation by the head {32]. Fzd5 does not activate the 0-catenin signal- 
0-catenin. The mRNA levels of these three target genes ing pathway on its own, as the presence of its endogenous 
(c-jnyc, cyclin Dl and PPAR 5) correlated largely with the ligand is also required. Our results suggest that F2d5 may 
expression, of F2d5 in these samples, suggesting that the have a role in renal cell carcinogenesis due to its frequent 
.canonical pathway is activated. On the kidney tissue overexpression observed in these tumor samples, and we 
array, cyclin Dl protein expression showed a highly sig- hypothesize that if Fzd5 is overexpressed, it has a rather 
nificant correlation with the Fzd5 expression in the tumor limited effect, on p-ratenin signaling; However, in the 
samples (table 3). Cyclin Dl protein is frequently. overex- presence of its endogenous still unknown ligand, it acti- 
pressed in various tumors, but in only a. proportion of the vates the canonical Wnt signaling pathway. The eJucida- 
casesis it due to amplification of (he cyclin Dl gene[22J. tion of this ligand and its binding characteristics is still 
Therefore, other mechanisms such as upregulatiori of gene under investigation. Ultimately, knowledge of the specific 
. transcription may play a substantial role in the overex- expression patterns o'f both Wnt and Fzd members could 
pression of cyclin Dl [23-26]. Our data, showing in- lead to directed tumor targeting or could be used as a 
creased cyclin Dl expression in renal carcinoma samples, tumor marker, 
are consistent with the results of Stassar et ah [27], They 
studied genes that are associated with human renal carci- 
noma by suppression subtractive hybridization and re- Acknowledgment 
ported 14 differentially expressed genes, including cyclin 

DL Although we would have expected an increased nu- We aic * ra,cfuI 10 Prof E. Van Man* (University of Amwezp) Tor 

clear ^catenin staining, nuclear accumulation of p-cat* providine * With ,hc *¥>*"™** 
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IrninunoIustocJiemica! analysis of NY-ESO-1 antigen expression in normal 
and malignant human tissues. 

Jungbluth AA , Chen YT , Stockert E , Busam KJ , Kolb D, Iversen K , Coplan K , 
Williamson B , Altorki N, OjdMLJ. 

Ludwig [nstitute for Cancer Research, Memorial Sloan-Kettering Cancer Center, New 
York, NY, USA. jungblua@mskcc.org 

NY-ESO-1, a member of the CT (cancer/testis) family of antigens, is expressed in normal 
testis and in a range of human tumor types. Knowledge of N Y-ESO-1 expression has 
depended on RT-PCR detection of rnRNA and there is a need for detecting NY-ESO-1 at 
the protein level. In the present study, a method for the immunochemical detection of 
NY-ESO-l in paraffin-embedded tissues has been developed and used to define the 
expression pattern of NY-ESO-1 in normal tissues and in a panel of human tumors. No 
normal tissue other than testis showed NY-ESO-1 reactivity, and expression in testis was 
restricted to germ cells particularly spermatogonia. In human tumors, the frequency of 
NY-ESO~l antigen expression corresponds with past analysis of NY-ESO-1 rnRNA 
expression e,g., 20-30% of lung cancers, bladder cancers and melanoma, and no 
expression in colon and renal cancer. Co-typing of NY-ESO-1 antigen and rnRNA 
expression in a large panel of lung cancers showed a good correlation. There is great 
variability in NY-ESO-1 expression in individual tumors, ranging from an infrequent 
homogeneous pattern of staining to highly heterogeneous antigen expression. Copyright 
. 2001 Wiley-Liss, Inc. 
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Multidrug resistance phosphoglycoprotein (AECBl) iu the mouse 
placenta; fetal protection. 

Kalabis GM, Kostaki A . Andrews Mil. Petropoulos S, Gibb W , Matthews SG 

Department of Physiology, University of Toronto, Ontario, Canada. 

The multidrug resistance phosphoglycoproteln ATP-binding cassette subfamily B 
(ABCB1) actively extrudes a range of structurally and functionally diverse xenobiotics as 
well as glucocorticoids.ABCB 1 is present in many cancer cell types as well as in normal 
tissues. Although it has been localized within the mouse placenta, virtually nothing is 
known about its regulation. In the mouse, two genes, Abcbla and Abcblb, encode 
ABCB L. We hypothesized that there are changes in placental Abcbla and Abcblb gene 
expression and ABCB1 protein levels during pregnancy. Using in situ hybridization, we 
demonstrated that Abcblb mRNA is the predominant placental isoform and that there are 
profound gestational changes in the expression of both Abcbla and Abcblb mRNA. 
Placentas from pregnant mice were analyzed between Embryonic Days (E) 9.5 and 19 
(term approximately 19.5d). Abcblb mRNA was detected in invading trophoblast cells 
by E9.5, peaked within the placental labyrinth at E12.5, and dien progressively decreased 
toward term (P < 0.0001). Abcbla mRNA, although lower than that of Abcblb at 
midgestatton, paralleled changes in Abcblb mRNA. Changes in Abcbl mRNA were 
reflected by a significant decrease in ABCB 1 protein (P < 0.05). A strong correlation 
existed between placental Abeb 1 b mRNA and maternal progesterone concentrations, 
indicating a potential role of progesterone in regulation of placental Abcblb mRNA.' In 
conclusion, there" are dramatic decreases in Abcbla and Abcblb mRNA and in ABCB1 at 
die maternal-fetal interface over the second hklf of gestation, suggesting that the fetus 
may become increasingly susceptible to the influences of xenobiotics and natural steroids 
in the maternal circulation. 
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Expression of human telomerase reverse transcriptase gene and protein 
and of estrogen and progesterone receptors, in breast tumors: Preliminary 
data from nco-adjuvant chemotherapy. 
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Human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase is 
very closely associated with telomerase activity. Telomerase has been implicated in 
cellular immortalization and carcinogenesis, in situ detection of hTERT will aid in 
determining the localization of telomerase-positive cells. The aim of this study was to 
detect expression of hTERT mRNA, hTERT protein, estrogen receptor (ER) and 
progesterone receptor (PR) in paraffin-embedded breast tissue samples and to investigate 
the relat.onsh.p between hTERT expression and various clinicopathological parameters 
in breast tumongenesis. We used in situ hybridization (ISH) to examine hTERT gene 
e Xp ress.on, and I immunohistochemistry (IHC) to examine expression of hTERT protein 
ER and PR, m breast tissues including 64 adenocarcinomas, 2 phyllode tumors and their 
adjacent normal breast tissues. hTERT gene expression was detected by ISH in 56 ,88%) 
carcinomas but in neither of the 2 phyllode tumors. hTERT protein expression was 
detected by IHC ,n 52 (81%) carcinomas, but in neither of the 2 phyllode tumors 
Moreover, ER and PR were expressed in 42 (66%) and 42 (66%) carcinomas 
respectively, and in neither of the 2 phyllode tumors. In 4 cases of breast carcinoma that 
strongly expressed hTERT gene and protein before treatment, neo-adjuvant 
chemotherapy led to disappearance of gene and protein expression in all cases There was 
a strong correlation between detection of hTERT gene expression by ISH and of hTERT 
protein by ICH in tissue specimens from breast tumors. These results suggest that 
detection of hTERT protein by rCH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 
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Abstract. Human telomerase reverse transcriptase (hTERT), 
the catalytic subunil of telomerase, is very closely associated 
with telomerase activity. Telomerase has been implicated in 
cellular immortalizaLion and carcinogenesis. In situ detection of 
hTERT will aid in determining the localization of telomerase- 
positive cells. The aim of this study was to detect expression of 
hTERT mRNA, bTERT protein, estrogen receptor <ER) and 
progesterone receptor (PR) in paraffin-embedded breast tissue 
samples and to investigate the relationship between hTERT 
expression and various clinieopathologica! parameters in breast 
rumorigenesis. We used in situ hybridization (1SH) to examine 
hTERT gene expression, and imrnunohistochemistry (1HO to 
examine expression of hTERT protein, ER and PR, in breast 
tissues including 64 adenocarcinomas. 2 phyllode tumors and 
their adjacent normal breast tissues. hTERT §ene expression 
was detected by 1SH in 56 (88%) carcinomas, but in neither 
or the 2 phyllode tumors. hTERT protein expression was 
delected by JHC in 52 (81%) carcinomas, but in neither of 
the 2 phyllode tumors. Moreover. ER and PR were expressed 
in 42 (66%) and 42 (66%) carcinomas, respectively, and in 
neither of the 2 phyllode rumors. In 4 cases of breast carcinoma 
lhat strongly expressed hTERT gene and protein before 
treatment, neo-adjuvant chemotherapy led to disappearance of 
gene and protein expression in all cases. There was a strong 
correlation between detection of hTERT gene expression by 
JSH and of hTERT protein by 1CH in tissue specimens from 
breast tumors. These results suggest that detection of hTERT 
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protein by 1CH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 

Introduction 

Breast cancer is the most frequent malignancy in women, 
affecting up to one in every eight females worldwide. The 
most important clinieopathologica! prognostic parameter so 
far identified is the absence or presence of lymph node 
metastasis, but the identification of further parameters for 
both lymph node-positive and -negative patients would 
facilitate an individually based risk-directed therapy (1). A 
promising emerging molecular marker is telomerase. a 
ribonucleoprotein enzyme complex, which when activated 
or upregulated allows tumor cells to escape from cellular 
senescence and to proliferate indefinitely (2). The human 
telomere is a simple repeal sequence of six bases (TTAGGG) 
thai is located at the ends of each chromosome (3). Telomeres 
are believed to protect against degeneration, reconstruction, 
fusion, and loss (4) and to promote the homologous pairing 
of chromosomes (5). The end-io-end chromosome fusions 
observed in some tumors may result from the loss of telomeres 
and may be partly responsible for the genetic instability 
associated with lumorigenesis. Telomerase catalyzes the 
synthesis of telomere DNA and facilitates cell immortalization 
through the stabilization of chromosomal structure (6-8). 
Although the expression of the human RNA component of 
telomerase (hTERC) is widespread, the restricted expression 
pattern of the mRNA of hTERT. the human telomerase 
catalytic subunit gene, is correlated with telomerase activity 
(8-13). As telomerase activity seems to be the key player in 
tumor ceil immortality, it has importance as a target molecule 
for anti-cancer therapy. Telomerase activity has been shown 
to correlate with poor clinical outcome in neuroblastomas 
and other tumors (14). For breast cancer, however, telomerase 
activity is a controversial prognostic marker: some studies 
suggest that telomerase activity . clinicopathological parameter* 
and disease outcome are linked, whereas others do not find 
this associaiion (14-23). 
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We have succeeded in very clearly and sensitively demon- 
strating hTERT rnRNA in thyroid, colorectal, parathyroid 
and lung tissues by use of an oligonucleotide probe (13,24-26). 
Strong correlation has been observed between hTERT mRNA 
and^or protein expression and telomerase in a variety of 
malignant tumors (13.14,24.25,27,28)- In ihe present study, 
we used ISH to examine expression of the hTERT gene, and 
1HC to examine expression of hTERT protein, ER and PR, in 
64 carcinomas and 2 phyllodes rumors of breast to determine 
whether hTERT pjotein can be used to diffejentiate breast 
cancers. We' also analyzed hTERT mRNA and protein 
expression with special reference to clinical features and 
histological findings to investigate the potential role of 
hTERT mRNA expression analysis in predicting the biological 
characteristics of breast cancers. Since hTERT expression in 
breast rumors has not previously been analysed by ISH or IHC, 
our investigation also examined various clinicopathological 
parameters, including age, histopathplogical type, tumor size, 
lymph node status, relapses, and the expression of ER and PR. 

' Materials and methods , 

Tissue collection. Sixty-six samples were obtained during 66 
mastectomies: 64 breast carcinomas, 2 phyilodes tumors and 
66 specimens of the adjacent normal breast gland. In 4 cases. 
Samples were obtained during core needle biopsies (CNB) 
before neo-adjuvant therapy, and again during mastectomies 
after neo-adjuvant therapy; for these cases, all measurements 
and examinations were performed both before and after the 
neo-adjuvam therapy. The patients ranged in age from 32 lb 
90 years. The patients with carcinomas ranged in age from 37 
to 90, mean 56, years and were all women. The women with 
phyllode tumors were aged 32 and 38 years, respectively. 
The surgical and CNB tissue samples were frozen rapidly 
with liquid nitrogen and stored at -80*C until fixation. Then, 
they were fixed in 10% buffer formalin solution and 
embedded in paraffin. Surgical and CNB samples were 
collected from the patients after obtaining their informed 
consent, and the study protocol was approved by the Medical 
Department of the University of Tokyo Ethics Committee. 
The pathologic diagnoses were made by the surgical 
pathological specialists at our institute on the basis of 
examination of hematoxylin-eosin. stained slides. A 
pathological review was performed for all breast tumors 
according to the BRE score. pT and pN staging were assigned 
according to the 1997 WHO classification (7th edition). 

MCF-7 human breast cancer cells, kindly provided by the 
Cell Resource Center for Biomedical Research, Institute of 
Development. Aging and Cancer, Tohoku University, were 
used as positive controls. The cells were incubated in RPMl- 
1640 medium with 25 mM HEPES buffer. L-glutamine. and 
10% fetal . bovine serum (Gibco, Grand Island. NY) on a 
chamber-attached slideglass (Lab-Tek 3 Chamber Slide™: 
Nalge Nunc International. NaperviJic, IE) in a humidified 5% 
CO? atmosphere at 37"C. The cells were then fixed with 10% 
buffered neutral formalin (Sigma Chemical Co., St. Louis. 
MO). The cultured MCF-7 cell line that was used as a positive 
control was tested for telomerase wiih a PCR- based standard 
TRAP assay (6.13). These cells were also used ro prepare cell 
blocks. Briefly, the cells were fixed in 10% buffeied neutral 



formalin, resuspended in molten agarose and then embedded in 
paraffin. Sections from these cell blocks were used as positive 
procedural controls in ISH and IHC. The negative control in 
ISH was obtained by replacing the oligonucleotide probe with 
RNase. The negative control in IHC was obtained by replacing 
the primary antibody with Tris-buffered saline (TBS). 

Oligonucleotide probe for JSH* The specificity of the oligo- 
nucleotide sequence was initially determined by a GenEMBL 
database search using the Genetics Computer Group Sequence 
Analysis Program (GCG. Madison, WI) based on the fastA 
algorithm (29); the sequence exhibited 100% homology 
With the hTERT gene sequence. A d(T);o oligonucleotide 
was used io verify the integrity and lack of degradation of 
the mRNA in each sample. All oligonucleotide probes 
were synthesized with a hapten-labeled nucleotide, such as 
digoxigeninrdVrrP (Boehringer-Mannheim), at the 3" end via 
direct coupling by using standard pbospboramidite chemistry 
(Research Genetics, HumsvilJe, AL) (30). The probe used for 
detection of hTERT by ISH was generated from the original 
sequence for Homo sapiens telomerase reverse transcriptase 
(AF0 15950), 2766-2800: 5'-GCCTCGTCTTCTACAGGGA 
AGTTCACCACTGTCTT-3' (13,24-26). 
> 

ISH. ISH was performed with the GenPoint nucleic acid hyper- 
detection system (Dako; Carpenteria, CA) (31). Formalin- 
fixed, paraffin-embedded tissue sections (5 pm thick) were 
deparaffinhed in xylene and a graded alcohol series. Tissue 
sections and CNB samples were then pretreated with target 
retrieval solution (Dako, Si 700) at 95*C and proteinase K 
(Dako, S3004) at roorn temperature. Next; the tissues and 
CNB samples were fi*ed in 03% hydrogen peroxide followed 
by a methyl Blcobol series at room temperature. Digoxigcnin- 
labeled anti-sense oligonucleotide in mRNA in situ hybri- 
dization solution (Dako, S3304) was placed over the tissues 
and CNB samples. After hybridization at 37*C overnight, 
the slides were washed in stringent wash solution (Dako, 
GenPoint System Kit) at 45'C. The tissues and CNB samples 
were exposed to avidin blocking solution (Dako, X0590) 
at room temperature followed by biotin blocking solution 
(Dako, X0590) at room temperature. The tissues and samples 
were then incubated at room temperature with a sheep 
monoclonal hapten- labeled anti-digoxigenin antibody (Dako. 
p5i04), and the slides were then fixed with biotinyl tyramine 
(Dako, GenPoint System Kit) at room temperature. Finally, 
the slides were incubated with HRP-conju gated streptavidin 
(Dako, GenPoint System Kit) at room temperature. Since 
33'-diaminobenzidine leuabydrochloride (DAB) was used as, 
the substrate, a positive reaction was visible as a brown color 
under a light microscope. The sections were weakJy counter- 
stained with 0.1% hematoxylin. 

JHC. IHC was performed by the avidin-biotin complex/ 
horseradish peroxidase method. Tissue sections were stained 
for hTERT with a commercially available monoclonal antibody 
(NCL-L-hTERT; Novocastra, Newcastle upon Tyne, UK). 
Sections were dewaxed in xylene. Aniigen retrieval was done 
by incubating sections immersed in 0.0 J M citrate buffer at 
pH 6.0 in a miciowave oven at 99"C. The sections were 
allowed to cool down at room temperature. The sections 
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Table I. Relationships between mRNA status (negative/ 
positive) by 1SH and standard clinical, pathological, and 
biological facton in tbc 66 tumors. 



No. of patients (%) 



Total 
population 
(%) 



bTERT 
negative 



hTERT 
positive 



Figure J. Representative results of the TRAP assay. Jf functional lelomerasc is 
present, the enzyme adds DNA lo the substrate in 6-base-paii (bp) increments, 
resulting hi a ladder-like distribution of products. The 6- bp ladder signals art 
apparent /or MCF-7 and breast cancers (case nos. 6, 14, 23.44 and 59) and art 
not apparent for lysis buffer as the negative control. An m tract of MCF-7 was 
used as a positive control for the TRAP assay and as an Internal TeJornerase 
Assay Standard (JTAS) positive control for PCR amplification, with lysis 
buffer as the negative control (Lysis). 



were then immersed in 1% hydrogen peroxide (H 2 0 2 ) in 
methanol to block endogenous peroxidase activity. Following 
that, the sections were washed in TBS (pH 7.6) before being 
incubated in normal Tabbit serum for 20 min 10 block non- 
specific binding. After draining off (he excess serum, the 
sections were incubated with the primary antibody at room 
temperature. The sections were washed in TBS before being 
incubated with the secondary antibody (biotinylated rabbit 
anti-mouse, Dako). The sections were washed again with 
TBS and incubated with avidin bioiin complex/horseradish 
peroxidase. After washing the sections wjih TBS, peroxidase 
activity was visualized under light microscopy by applying 
DAB chromogen (Dako). The sections were counterstained 
with hematoxylin, dehydrated in increasing grades of alcohol 
and finally mounted in dibutyl pbthalate (DPX) moumant. 

Homogeneous staining or a speckled/dotted pattern in the 
nucleus was considered positive staining, and absence of 
distinct nuclear staining was taken as negative staining. 
Grading of the percentage of stained cells (hTERT labeling 
index) was performed by previously published criteria (I) as 
follows: Grade I, negative staining; Grade 2, 1-10% positive 
staining nuclei; Grade 3, 11-50% positive nuclei; and Grade 4, 
>50% positive nuclei. Jmmunbstained slides for ER and 
PR were scored as previously described (32,33). Jn brief, 
each entire slide was evaluated by light microscopy. First, a 
proportion score was assigned, which represents the estimated 
proportion of positive- staining tumor cells (0, none; 1 , < 1/100; 
2. 1/100 to 1/10; 3, 1/10 to 1/3; 4, 1/3 to 2/3; and 5. >2/3). 
Next, ari intensity score was assigned, which represents the 
average intensity of positive tumor cells (0. none; 1, weak, 
2, intermediate: and 3, strong). The proportion and intensity 
scores were then added to obtain a total score, which ranged 
from 0 to 8. Slides were sco;ed by pathologists who did not 
have knowledge of ligand binding tesuhs or patient outcome. 
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<50 


26 


6 (23 .» 20(765) 


>50 


40 


6(15,0) 34(85.0) 


Hisiopalho- 






logical type „ 






Scirrhous 


32 


2 (6.4) 30(93.6) 


PapillotubuJar 


20 


2(10.0) 18(90.0) 


Solid tubular 


6 


1 (16.7) 5(833) 


Mucinous 


2 


1 (50.0) .1 (50.0) 


Non-invasive 


4 


2(50.0) 2(50.0) 


PhyiJodes 


2 


2 (100) 0 (0) 


Tumor size 






(crny 






Tl (<2.0) 


20 


0 (0) 20 (100) 


T2 (2.0-5.0) 


34 . 


6(17.6) 28(82.4) 


T3 (>5.0) 


10 


2 (20.0) 8 (80.0) 



Lymph node 
status* 

pNO 

pNl 

pN2+pNM 
Relapse 



ER expression 
+ <*2) 

- (<2) 

PR expression 
-M*2) 

- «2) 



34 6(17.6) 28(82.4) 
26 2 (7.7) 24(92.3) 
4 0 (0) 4 (100) 



10 0 (0) 10 ()00) 

56 10(17.6) 46(82.4) 

42 6(143) 36(85.7) 
24 4 (16.7) 20<833) 

42 4 (9.5) 38(90-5) 

24 6(25.0) 18(75D) 



P-vaJue* 



NS 



NS 



NS 



NS 



NS 



NS 



NS 



°X ? test- NS. not significant. ^Information available for 64 patients. 



Statistical analysis: Differences in p- values were analyzed 
with the x 7 test for independence, and Fisher's test was used 
for correlations. In all comparisons, p-c0.05 was considered 
significant. 



Results 

Representative results of die TRAP assay are shown in Fig. t. 
The cultured cells, which were tested for teJomerase activity 




Figure 2. Conthmon beiw«n hisioJo^ic diagnosis, human lelomrraie stvtnt irawripiDSt (bTERT) mRNA by ISH. hTERT protein by IHC. esuo»cn 
receptor (EJt) by IHC and prostMercmc rrccpw (PR) by IHC in bftasi cant to. (a. H&E); (b. bTERT mRNA). <c, hTERT ptoicin); (d. ER) and {e. PR). 

JHC revealed thai hTERT prolein was strongly expressed 
in the nuclei, nuclear membrane and cytoplasm of almost all 
of ibe MCF-7 human cancer cells (data not shown). Expression 
of hTERT protein was delected in 52 (81%) of the 64 breast 
cancers and in none of the phyllode lomors of the breast 
(Table II)- The levels of expression were heterogeneous within 
the carcinomatous regions. As shown in Fig. 2c; strong 
expression of hTERT protein was observed, in nuclei, nuclear 
membrane and cytoplasm, similar to the pattern in MCF-7 
human cancer cells. Normal mammary gland and stromal 
cells generally showed negative immunoreactivily against 
hTERT pioiein antibody. 

A nuclear signal for the ER (Fig. 2d), as assessed by IHC 
was observed in 36 (569o) of the 64 breast cancers and in 
none of the phyllode tumors of the breast, with positive 
scores ranging from 2 to 8 (Tables I and J)). A nuclear signal 
for the PR (Fig. 2e). as assessed by JHC, was observed in 38 
(59%) of the 64 breast cancers and in none of the phyllode 
rumors of the breast, with positive scores ranging from 2 to 8 
(Tables I and II). 

We used ISH and IHC to examine hTERT expression 
in 4 oses of breast cancer before and after neo-adjuvaril 



with the TRAP assay, cave positive results with aJI procedural 
controls (MCF-7 and 5 bteast cancer samples) (Fig. I). 

JSH revealed that hTERT mRNA was sirongjy expressed 
in the nuclei and cytoplasm of almost all of the MCF-7 human 
cancer cells (data not shown). Expression of hTERT mRNA 
was detected ijv56 (88%) of the 64 breasl cancers and in none 
of the phyllodes tumors of the breast (Table I) with the anti- 
. sense probe, whereas no expiession was detected with the anti- 
sense probe treated with RNase (data not shown). The levels of 
expression were heterogeneous within the carcinomatous 
regions. Strong expression of hTERT mRNA was not confined 
lo the carcinomatous regions but was also delected in 
infiltrating lymphocytes (Fie. 2a and b). Higher expression 
levels of both signals of hTERT mRNA were detected in 
some sections containing both carcinomas and lymphocytes, 
compared with the adjacent non-cancerous mammary g!and. 
but no clear differences in signal intensity were observed 
between carcinomas and lymphocytes. The signals in both 
the normal and cancer (issues were mainly present in the 
lymphocytes, and the signal intensity was similar in both, 
although a precise quantitative comparison of the in situ signals 
was impossible. 



Table IK Relationships between mRNA status {negative/ 
positive) by IHC and standard clinical, pathological, and 
biological factors in the 66 tumors. 



Table III. Relationship of hTERT mRNA and protein 
expression before and after neoadjuvant chemotherapy. 

hTERT mRNA hTERT protein 



No. of patients {%) 

Total hTERT hTERT 
negative positive 



Total 

Age 
<50 
>50 

Histopaihological 

type 

Scirrhous 

papillorubular 

Solid tubular 

Mucinous 

Non-invasive 

Phyllodes 

Tumor size (cmf 
Tl (<2.0> 
T2 (2.0-5.0) 
T3 (>5.0) 

Lymph node status* 
pNO 
pN) 

pN2+pNM 
Relapse 



ER expression 
+ (^2) 

- (<2) 

PR expression 
+ (*2) 

- (<2) 



p-value* 



66 


14 (ZJ J.) 












NS 


26 


10(383) 


16(61.5) 




40 


4(10.0) 


36 (90.0) 










NS 


32 


2 (6.4) 


30 (93.6) 




20 


4 (20.0) 


16(80.0) 




6 


4 (667) 


2 (333) 




2 


0 - (0) 


2 (100) 




4". 


2 (50.0) 


2 (50.0) 




2 


2 (100) 


0 (0) 










NS 


20 


4 (20.0) 


16(80.0) 




34 


6(17.6) 


28(82.4) 






2 (20.0) 


8 (80 0) 










NS 


34 


6(17.6) 


28 (8214) 




26 


6(23.1) 


20 (76.9) 




4 


0 (0) 


4 (100) 










NS 


10 


1 (10.0) 


9 (90.0) 




56 


13(232) 


43 (79.6) 










NS 


42 


6(143) 


36(85.7) 




24 


8 (333) 


16(66.7) 










NS 


42 


6(143) 


36 (85.7) 




24 


8(33.3) 


16(66.7) 





IcM. NS. not significant. b lhformaik>n available foi 64 patients. 



chemotherapy. Before chemotheiapy , all 4 of the breast 
carcinomas strongly expressed hTERT by both ISH and 
IHC After chemotherapy. hTERT expression completely 
disappeared in all 4 cases (Table 111). hTERT expression by 
lymphocytes was detectable by ISH and IHC both before and 
after chemotherapy in all 4 cases, and the level of expression 
did not appear to be altered by tieatrnent. 

No correlation was observed between hTERT mRNA 
expression and any of the clinicopathoioeical parameters 
age. hisiopMhoIoeica) type, tumor lymph node status. 



Case 


Age 


Neoadjuvant. 


Before After 


Before After 
* 


1 


80 


Anastxozole 




+ 


2 


78 


Anastrozole 


+ 


+ 


3 


35 


FED 


+ 


+ 


4 


37 


AO 


f 





'FEC.5FU (500 mg/m^. Epirubicin (70 mg/m^. Cyclophosphamide 
(500 rog/m J ). *AC, Doxorubicin (60 mg/nV). Cyclophosphamide 
(500 mg/m*). Before, before neoadjuvant chemotherapy. After, 
after neoadjuvant chemotherapy. + . positive; negative. 



relapses, and the expression of ER and PR. Similarly, there 
was no correlation between hTERT protein expression and 
any of these clinicopathological parameters. There was a 
correlation between hTERT mRNA expression and hTERT 
protein expression in breast cancers (p<0.005). 

Discussion 

This study repons a comparison of hTERT mRNA expression 
by ISH and hTERT protein expression by IHC in tissue 
sections from breast tumors. hTERT mRNA was detected 
by ISH in 56 of the 64 breast cancers and in MCF-7 human 
breast cancer cells. Breast cancer cell nuclei stained strongly 
positive with the specific anti-sense probe but not with the 
anti-sense probe treated with RNase (data not shown). Tissue 
lymphocytes also stained positively with the anti- sense probe, 
but the stromal cells did not. Expression of hTERT protein 
was observed by IHC in 52 of the 64 breast cancers. hTERT 
mRNA and protein expressions were highly correlated m 
breast cancers (p<0.005). Detection of the hTERT protein by 
IHC has permitted further analysis of carcinogenesis and 
cancel diagnosis (34). 

In recent years, there has been disagreement over the 
suitability of telomerase activity as a prognostic biologic 
marker in breast cancer that may help to differentiate patients 
for individually based risk-related therapy. Hiyama tt al (15). 
in a study of 140 breast cancer specimens with the TRAP 
assay, found a strong association between telomerase activity 
and stage classification and observed telomerase activity 
in 68% of stage 1 tumors and 95% of stage IV tumors. 
Po/emba ei al(\), using tissue microanays, found a statistically 
siEnif*canl correlation berween rumor-specific survival (overall 
survival) and hTERT expression in breast cancer/However, 
some problems in interpretation may affect this apparent 
consensus. First, some samples of breast cancer tissue may 
be extensively contaminated by infiltrating lymphocytes 
during operative manipulations, especially in advanced disease, 
causing overestimate of telomerase activity and/or hTERT 
expression. In om previous repons. higher expression levels 



of signals for both hTERT mRNA and protein were detected 
in some sections containing both carcinoma and lymphocytes 
in thyroid and colorectal cancers (13,24). Secondly, Poremba 
et al ( 1) used polyclonal antibodies against bTERT protein as 
a signal for expression. In our bands, polyclonal antibodies 
against hTERT piotein give rise to strong background signals 
and art not clearly specific for rneasuring expression in cancer 
tissues. We have carefully compared the reactivity against 
hTERT protein of the monoclonal antibody osed in the 
present study with that of some polyclonal antibodies. Use of 
the monoclonal antibody in 1HC allowed clear demonstration 
of hTERT protein expression, with results similar to those 
of 1SH for hTERT mRNA expression. Furthermore, IHC is 
technically much easier to perform than 1SH, since con- 
tamination of samples by RNase is not an issue in IHC. 

To the best of our knowledge, this report is the first on the 
study of hTERT expression in breast cancer as a function of 
neoadjuvant treatment. We examined hTERT expression 
in 4 cases of breast cancer before and after chemotherapy. 
Before chemotherapy, hTERT was strongly expressed in 
all 4 carcinomas, but after chemotherapy hTERT expression 
had completely disappeared in all 4 cases. hTERT expression 
by. lymphocytes was detectable by 1SH and IHC both before 
and after chemotherapy in all 4 cases, and the level of 
expression did not appear to be altered by treatment. 

In conclusion, determination of hTERT mRNA expression 
by 1SH and hTERT protein expression by IHC can be used to 
obtain information contributing to a histopathologic^ diagnosis 
during screening of breast cancers. By use of a monoclonal 
antibody, we could very clearly and sensitively demonstrate 
hTERT protein expression in breast cancer tissues but not in 
non-cancerous tissues. We also demonstrated that 4 carcinomas 
with originally positive immunoreactiviiy against hTERT 
protein became negative after neo adjuvant chemotherapy. 
These results suggest that determination of hTERT protein 
by JCH can be used as a potential diagnostic and therapeutic 
maTker io distinguish breast cancers. 
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Expression of the ubiquitui-proteasoiuc pathway and muscle loss in 
experimental caucer cachexia. 

KUMi, Wyjce SM, Russcti ST. Hinc AV. Tisdale M J 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham, UK. 

Muscle protein degradation is thought to play a major role in muscle atrophy in cancer 
cachex.a. lo mveshgate the importance of the ubiquitin-proteasome pathway, which has 
been suggested to be the main degradative pathway mediating progressive protein loss in 
cachex.a, the express.on ofmRNA for proteasome subunits C2 and C5 as well as the 
ub.qu.tm-co.yugating enzyme, E2(I4k), has been determined in gastrocnemius and 
pectoral muscles of mice bearing the MAC 16 adenocarcinoma, using competitive 
i quantitat,ve reverse transcriptase polymerase chain reaction. Protein levels of proteasome 

r ( V deteimined ^ to ensure changes in m RNA 

r 1?. , m f g6S m Pr ° tein ex P rcssion - Mus <^ weights correlated linearly with 
wetght loss dunng the course of the study. There was a good correlation between . 
expressron of C2^ and E2(14k) rnRNA and protein levels in gastrocnemius muscle with 
mcrease, ( Of 6-8-fold for C2 and two-fold for E2(14k) between 12 and 20% weightVoss 
Z£S V ST* m CXpreSsion at wej g ht "»« of 25-27%, although loss of muscle 
J r m ' ^ ntraSt * eX P ression of 05 raR NA only increased two-fold and was 
^ 31 Wdght i° SSeS bGtWCen 7 5 - d 2™ Both proteasome functicS 
activity, and proteasome-specific tyrosine release as a measure of total protein 
degradahon was also maximal at 18-20% weight loss and decreased at higher weight loss 

P?° TrT Tp^ in IT™ 1 mUSde f ° Uowed a different P atter « ™* i~s in ' 
C2 and C5 and E2(14k) rnRNA only being seen at weight losses above 1 7% although 

muscle loss mcreased progressively with increasing weight loss. These results suggest 

that act, vation of the ubiquitin-proteasome pathway plays a major role in protein lL in 

gastrocnemius muscle, up to 20% weight loss, but that other factors such as depression in 

protem synthesis may play a more important role at higher weight loss. 

PMID: 16160695 [PubMed - in process] 
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Increased expression of proteasome subunits in skeletal muscle of cancer 
patients vvitk weight loss. 

Khal_J, Hinc AV . Fearon KC . Dcjong CH . Tisdale MJ . 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham B4 7ET, UK. 

Atrophy of skeletal muscle is common in patients With cancer and results in increased 
morbidity and mortality. In order to design effective therapy the mechanism by which 
this occurs needs to be elucidated. Most studies suggest that the ubiquitin-proteasome 
proteolytic pathway is most important in intracellular proteolysis, although there have 
been no reports on the activity of this pathway in patients with different extents of weight 
loss. In this report the expression of the ubiquitin-proteasome pathway in rectus 
abdominis muscle has been determined in cancer patients with weight loss of 0-34% 
using a competitive reverse transcriptase polymerase chain reaction to measure 
expression of mRNA $r proteasome subunits C2 and C5, while protein expression has 
been determined by western blotting. Overall, both C2 and C5 gene expression was 
increased by about three-fold in skeletal muscle of cachectic cancer patients (average 
weight-loss 1 4.5+/-2.5%), compared with that in patients without weight loss with or 
without cancer. The level of gene expression was dependent on the amount of weight 
T/oT^f 5 ^ 8 max,mal, y for both Proteasome subunits in patients with weight loss of 
12-iy /o. Further increases m weight loss reduced expression of mRNA for both 
proteasome subunits, although it was still elevated in comparison with patients with no 

r , n.? S tJ here WaS n ° evideoce for m increase ^ expression at weight losses less 
than 10 /o There was a good correlation between expression of proteasome 20Salpha 
subunits, detected by western blotting, and C2 and C5 mRNA, showing that increased 
gene expression resulted in increased protein synthesis/Expression of the ubiquitin 
conjugating enzyme, E2(14k), with weight loss followed a similar pattern to that of 
proteasome subunits. These results suggest variations in the expression of key 
components of the ubiquitin-proteasome pathway with weight loss of cancer patients and 
suggest that another mechanism of protein degradation must be operative for patients' 
with weight loss less than 10%. . 
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CeU type-specific occurrence of caveolin-lalpha and -lbcta in the lung 
caused by expression of distinct mRNAs. 

Kogo H, Aiba T. Fmimoto T . 

Department of Anatomy and Molecular Cell Biology, Nagoya University Graduate 
School of Medicine, Showa-ku, Nagoya 466-8550, Japan, hkogo@fujita-hu.ac.jp 

Two isoforms of caveolin-l,' alpha and beta, had been thought to be generated by 
alternate translation initiation of an mRNA (FL mRNA), but we showed previously that 
a variant mRNA (5'V mRNA) encodes the beta isoform specifically. In the present study 
we demonstrated strong correlation between the expression of the caveolin- 1 protein ' 
isoforms and mRNA variants in culture cells and the developing mouse lung The alpha 
isoform protein and FL mRNA were expressed constantly during the lung development 
whereas^expresaon of the beta isoform protein and 5'V mRNA was negligible in the fetal 
lung before 1 7.5 days post coitum, and markedly increased simultaneously at 1 8 5 days 
post coitum, when the alveolar type I cells started to differentiate. Immunohistochemical 
analysis revealed the cell type-specific expression of the two isoforms; the alveolar type: I 
cell expresses the beta isoform predominantly, while the endothelium harbors the alpha 
isoform chiefly. The mutually exclusive expression of caveolin-1 isoforms was verified 
byWestem blotting of the selective plasma membrane preparation obtained from the 
endothelial and alveolar epithelial cells. The present result indicates that the two 
caveolin-1 isoforms are generated from distinct mRNAs in vivo and that their production 
is regulated independently at the transcriptional level. The result also suggests that the 
alpha and beta isoforms of caveolin- 1 may have unique physiological functions. 

PMlD: 15067006 [PubMed - indexed for MEDLINE] 
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Oncogene and growth factor expression in ovarian cancer. 

KommossF, Baukncckt T, Birmclin G , Kolilei M . tescu Bf . PfleidererA . 

Department of Gynaecology, Albert^udwig University, Freiburg, Gennany. 

The varying tumor-biological behavior of ovarian carcinomas probably influences both 
Oar operand, y and response to chemotherapy, which are the most relevant progtosl 
factors. The phenotype of different ovarian carcinomas is obviously associated wTtnt 

TS^^mf^r^ signaI p ? way > indudmg c - myc ^ 

Analysts of EGF-R, TGF-alpha, c-myc. and c-jun expression in 33 stage UUIV and 2 
stage r/lfovanan carcinomas with biochemical, molecular^hemical and 

"of^^ S i° Wed ' C0ITdati0n betWeCn the and P rotei « 

levels of EGF-R and TGF-alpha for tumors with low or high expressing rates However 

Uie concentrate of measurable free EGF-Rs seems to depend on the L unt oCTGF ' 

Sh,r X ?r ,0n I ^ tUm ° ri - ThC EGF " R bindin « TGF-alpha is produced by 

ep.thehal tumor cells; stromal cells are usually TGF-alpha-negative, as shown by 
tmmunohtstochemistry. High expression rates of EGF-R. TGF-alpha and c-myc were 

S m W\T 10 0 h Ut ,° f , 35 , t OVarian CarCin ° maS ' res P ect -ely. C-jun mi^A wal 
tl^tZZl f S ^ ^ 0n - ma,i e nant originating from myometrium or 

ovary expressed no (or only small amounts of) EGF-R or TGF-alpha mRNA whereas a 
htgh c-myc expression was found in 1/7 normal myometria, and in 2/5 no^I TvaS. 
lnere was no strong correlation between EGF-R/TGF-alpha and c-myc/c-iun 
expresston.(ABSTRACT TRUNCATED AT 250 WORDS) 

PMDD: 1502888 [PubMed - indexed for MEDLINE] 
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Somatostatin receptors in primary human breast caaccr: quantitative 
analyse ot mRNA for subtypes 1-5 aud correlation with receptor protein 
expression and tumor pathology. 

Sal F ?\ D ^T ReS6arC ' 1 ' '- )e P artr " ent of Medicine, McGill University, 

SS« ^ Jt° SP u 68 n PmC AVCnUe West ' H3A 1 A1 Montrea I. Q^ec, Canada 
ujendrakumar@muhc.mcgill.ca 

^it, Statin K reCe f 0I " (SS T RS) haVe bCei1 identified m most hormone-producing tumors 
as well m breast cancer. In the present study, we determined SSTR1 -5 expression in 
primary ductal NOS breast tumors tlirough semi-quantitative RT-PCR and 
irruBunocytochemistry. The results from the analysis of 98 samples were correlated with 
several key lustological markers and receptor expression. All five SSTR subtypes are 

17t* y Q«o/ r ^?L a o 5^? NA1ivd in bfeaSt with 9l% of sample, stowing 

SSTR1 98% SSTR2, 96% SSTR3, 76% SSTR4, and 54% SSTRS. SSTR1-5 are 
locahzed to both tumor cells and the surrounding peritumoral regions as detected by 
immunocytochemistry. Levels of SSTR mRNA, when corrected for beta-actin levels 
were h.ghest for SSTR3 followed by SSTR1, SSTR2, SSTRS, and SSTR4. Furthermore 
W ^Jf™*™ mRNA P rotei " ^Pression with 84% for SSTRl', 

r^Jl SS', f° ^ SSTR3 ' 68% for SSTR4, 68% for SSTRS, and 78% for all five 
receptors SSTRl, 2 and 4 were correlated with ER levels whereas SSTR2 showed an 
■ additamal correlation with PR levels. These correlations were independent of patient age 
and histological grade. Moreover, using immunocytochemisriy, blood vessels exhibited 
receptor-specific localization for SSTR2 and SSTRS. Our results indicate significant 
correlahons between mRNA and protein expression along with receptor-specific 
correlations with histological markers as well as ER and PR levels. Differential 
distribution of SSTR subtypes in tumors and receptor-specific expression in vascular 
ag^nisT C0 " Sidered 38 a n0vel dia gn°sis for breast tumors with receptor subtype 
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A Iranscriptomic and protcomic analysis of- the effect or CpG-ODN on human 
THP-1 monocytic leukemia cells. 

KuoCC, KuoCW, LiaugiM, Lkng_SM. 

Institute of BioAgricultural Sciences, Academia Sinica, Taipei, Taiwan. 

The CpG motif of bacterial DNA (CpG-DNA) is a potent immunostimulating agent whose 
mechanism of action is not yet clear. Here, we used both DNA microarray and proteomic 
approaches to investigate the effects of oligodeoxynucleotides contauung the CpG motif (CpG- 
ODN) on gene transcription and protein expression profiles of CpG-ODN responsive THP-I cells 
:Microarray analysis revealed that 2 h stimulation with CpG-ODN up-regulated 50 genes and 
down-regulated five genes. These genes were identified as being associated with inflammation 
antimicrobial defense, transcriptional regulation, signal transduction, tumor progression, cell 
differentiation, proteolysis and metabolism. Longer stimulation (8 h) with CpG-ODN enhanced 
transcriptional expression of 58 genes. Among these 58 genes, none except one, namely WNTI 
inducible signaling pathway protein 2, was the same as those induced after 2 h stimulation 
Proteomic analysis by two-dimensional gel electrophoresis, followed by mass spectrometry 
identified several proteins up-regulated by CpG-ODN. These proteins included heat shock 
proteins, modulators of inflammation, metabolic proteins and energy pathway proteins 
Comparison of microarray and proteomic expression profiles showed poor correlation Use of 
more reliable and sensitive analyses, such as reverse transcriptase polymerase chain reaction 
Western blotting and functional assays, on several genes and proteins, nonetheless, confirmed that 
there is indeed good correlation between mRNA and protein expression after CpG-ODN 
treatment. This study also revealed that several anti-apoptotic and neuroprotective related 
proteins not previously reported, are activated by CpG-DNA. These findings have extended our 
knowledge on the activation of cells by CpG-DNA and maycontribute to further understanding of 
mechanisms that link innate immunity with acquired immune response(s). 

PMID: 15693060 [PubMed - indexed for MEDLINE] 
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1 Introduction 

Mammals protect Oremselves agains, pathogen infertion pn- 
manh. „„ mnale and adaptive immunity Ihe innate 
unmiHir s)Stern rel.es on a set of pat,e m recognition receptors 
(eg.. Joll-ldte .receptors) ,6 recognize foreign molecular struc- 
tures such as hpopolysaccliaride (LPS) and bacterial DNA |2 3] 
lnna,e .mmune cells recognize these molecular structures and 
.nrtra.e nol only i„„a,e bu, also adaptive immunity by produc- 
ing immunomodulatory cytokines and activating T and B 
»mm„„e cells |1J. Bacterial DNA can directly activate B cells to 
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proliferate and secrete immunoglobulins in a Tcell-independ- 
ent manna [4-6), It also induces B cells and monocytes to 
activate transcription factor NF-kB and secrete cytokines, 
including interleukin (11) 12, tumor necrosis factor a (TNF-a), 
and interferon tr/p p-10]. The immunostimulatory activity of 
bacterial DNA has been assigned tounmethylated CpG motifs 
(GACGTTfor murine, GTCGTTfor human) [il]. Recent evi- 
dence shows that synthetic oligodeoxynudeotides containing a 
CpG motif (CpG-ODN), like bacterial DNA with the CpG moi- 
ety (CpG- DNA), induce potent Thl-like immune responses 
that are protective against several infectious agents and 
immune disorders in animal models [1 2, 1 3). Biologically a ctive 
CpG-ODN, like bacteria DNA, activates ' macrophages and 
immature dendritic cells to increase expression of MHC class 
11 and costimulatory molecules, thereby transcribing cytokine 
mRNAs, and producing prainilarnmatory cytokines including 
TNFa, IL-1, IL-6 and IL-12 [9, 14-16]. CpG-ODN can therefore 
serve as an adjuvant and immunomodulator in vaccines 
against a wide variety of targets, including infectious agents, 
cancer a ntigens and all ergens (17}- 

It has been suggested that unmethylated CpG-DNA-medi- 
ated immune activation functions through a toll-like receptor 9 
(TLR9) signaling pathway |18J. Endocytosis and secruentially 
endosomal maturation as well as bin ding of heat shock protein 
(HSP) 90 to CpG-DNA are essential for induction of TLR9 sig- 
nal transduction |19, 20). It has also been shown that recogni- 
tion of CpG-DNA causes TLR to form a dimer, which recruits 
the adaptor molecule, myeloid differentiation factor 88 
(MyD88), through interactian between their C-terrninal ToH/lL- 
1R domains. This recruitment of MyD88 to the Toll/lL-lR do- 
main of TLR 9 initiates a signaling pathway that sequentially 
involves 11- 1 R-associated kinase 1 and TNF-a receptor-asso- 
ciated factor 6 [18, 21, 22). Studies using gene-deficient mice 
and RAW2647 cells transiently transfected with dominant- 
negative forms of these molecules have indicated that the 
MyD88-mediated signaling pathway is essential for CpG-DN A- 
induced activation of NF-kB and c-Jun NH 2 -terminal kinase 
(JNK), as well as subsequent production of cytokines in mono- 
cytic cells [18, 21 , 22]. The precise mechanism of action of CpG- 
DNA and CpG-ODN, nonetheless, is still not thoroughly 
understood- To further elucidate the molecular events after 
binding of CpG-ODN toTLR9, in this study, we treated CpG- 
ODN responsive THP-1 cells with CpG-ODN and evaluated 
changes by using DNA microarray and proteomic approaches. 
We have discovered up- regulation of more than 50 disting- 
uished genes/proteins and identified induction of several ant> 
apoptonc and neuroprotecting genes by CpG-ODN treatment. 

2 Materials and methods 
2.1 Reagents 

Phospborothioate- modified CpG-ODN and GpGODN were 
synthesized by MDBio {Taipei, Taiwan). Human specific 
ODN sequences are: CpG-ODN, 5'-TCG TCG TjTTGT CGT 
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TJT GTC GTT-3'; GpC-ODN, 5'TGC TGC TFT TGT GCT 
TXT GTG CTT-3\ The mouse specific CpG-OD N sequence is 
5*-TCC ATG ACG TTC CTG ATG CT -3' : CHCA was from 
Sigma (St Louis, MO, USA). 

Z2 CeRcurture 

Cell lines were obtained from the American Type Culture 
Collection: (Rockville, MD). Mouse RAW264.7 macrophage, 
and human embryonic kidney 293 cells (HEK293) were cul- 
tured in DM EM supplemented with 10% heat inactivated 
fetal bovine serum, 100 U/mL penicillin, 100 ug/inL strep- 
tomycin sulfate, 200mmol/L L-gJutamine, and 50 um p-mer- 
captoethanol in a humidified atmosphere of 5% C0 2 at 37°C 
The medium was changed every 2 days for all experiments. 
Human THP-1 monocytic leukemia cells, which have been 
shown .to express TLR9 and respond to CpG-DNA stimula- 
tion [23, 24), were cultured in RPMI1640 with the same sup- 
plements as for RAW2647 cell cultures. 

23 Human cDNA microarray 

Total RNAs extracted from cultured THP-1 ceDs were isolated 
with TRIzol (lnvitrogen, Leek, The Netherlands) :uid submitted 
to Genasia Biotechnology (Taipei. Taiwan) for further proces- 
sing. In brief, 4 ug of total RNA from CpG-ODN stimulated, or 
normal THP-1 ceDs was labeled with a fluorescence marker 
(U -vision, (Taipei, Taiwan)). Different colored fluorescence dyes 
(Cy5 and Cy3) were used to distinguish total RNA from normal 
and ODN stimulated cells. The labeled RNA was used for 
hybridization with the Human 1 cDNA microchi]> from Agilent 
Technologies (Palo Alto, CA, USA).The chips were scanned and 
the expression pattern was analyzed using genetdiip software. 
Genes showing up-regulation or down-regulation of RNA levels 
were analyzed and identified on a genomic database as sug- 
gested by the manufacturer of the microchip. 

2.4 Protein preparation 

THP-1 cells were seeded in a 175 cm 2 tissue culture flask at a 
density of 10 6 cells per milliliter in culture medium. The cells 
were stimulated with or without 1.5 UM CpG-ODN at defined 
times and harvested by centrifugation at A°C, 1000 x g for 
I5min. Cell pellets were washed twice with ice-cold PBS, 
resuspended and sonicated in extraction buffer containing 
25 mM Tris HQ ( P H 75), 2mw p-mercaptoeth;mol and pro- 
lease inhibitor cocktail. After centrifugation al 10000 x g for 
20 m in, ammonium sulfate was added to tire supernatant until 
the final concentration readied 50% saturation w/v. Hie solu- 
tion was stirred at 4°C for 30min and centrifuged at 10 000 x g 
for 30min af4°C The supernatant fraction -was then trans- 
ferred into a fresh tube, and the precipitated protein pelJet 
soluhilized in extraction buffer. To remove salts and other con- 
taminants, the extracts were treated with a pre-ccoJed (- 20X) 
solution of 10% TCA in acetone with 0 07% p-meicaptoetha- 
no). Proteins were allowed to precipitate overnight at -20 e C 
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After centrifugation, the pellet was washed with ice<old ace- 
tone, containing 0.07% f^merxaptoethanoL The supernatant 
was discarded and the pellet dried in a SpeedVac system (Model 
AES1 01 0; Savant, Holbrook, NY, USA). 

2.5 2-DE 

2-DE was performed using an IPGphore 1EF and a Hofer 
DALT vertical unit (Amersham Biosciences; Piscatawa^ NJ, 
USA). One milligram of dried protein sample was dissolved 
in 350 uL of rehydration buffer solution, containing 7 M urea, 
2 M thiourea, 4% w/v CHAPS, 5 mM tributyl phosphine, and 
2% IPG and loaded onto an immobilized pH 3-10 linear 
gradient strip (18 cm), followed by rehydration for 16h. 1EF 
was then performed in the following manner. 100 V for 
30 min, 250 V for 30min, 500 V for 30min, 1 000 V for 
30 min, 4000 V for 30 mm, 6000 V for 55 000 Vh. At the end 
of 1EF, the IPG strips were equilibrated for 15 min in buffer 
containing 6 m urea, 2% w/v SPS, 30%v/v glycerol, and 
50 mM Tris, pH 6.8, then reduced with 65 mM dithiorery- 
thritol (DTE) and subsequently alkylated with 135mM 
iodoacetamide for another 15 min. After equilibration, the 
IPG strips were immediately placed on top of a 12% SDS- 
PAGE (1.5mm, 20 x24cm). The second dimension gels 
were then overlaid with molten 0.8% agarose solution in 
SDS electrophoresis buffer. Electrophoresis was performed 
at 16°C, starting at 10mA per gel for 1 h, followed by 45 mA 
per gel until the dye front reached the bottom of the gels. 

2.6 Staining and image acquisition 

Immediately after electrophoresis, gels were stained with 
SYPRO Ruby (Molecular Probes, Eugene, OR, USA). Jn 
brief, gels were fixed for 30 min in 10% methanol, 7% acetic 
acid, and then stained overnight in SYPRO Ruby stain. The 
staining solution was removed and gels were washed in 10% 
methanol and 7% acetic acid for 3 h. After staining, image 
acquisition was carried out on a Typhoon 9200 (Amersham 
Biosciences). To identify a protein, spot detection, quantifi- 
cation and matching of 2-D results were analyzed using 
ImageMaster software (Amersham Biosciences). The M, of 
the proteins were calibrated according to the LMW-SDS 
Marker Kit (Amersham Biosciences), and their p/ values 
were estimated from the position of the protein spots on the 
2-D gel 3nd confirmed with the information supplied by the 
manufacturer. Since most of the p/ values for the truncated 
proteins had not been reported previously, the p/ values of 
the truncated proteins were estimated from the position of 
the observed spots. To omit the variation due to the use of 
separate gels, after background subtraction, the intensity 
levels of protein spots on each gel were normalized as a pro- 
portion of one reference spot, and protein quantities were 
calculated by integrating the density over the spot area. Pro- 
tein spots that showed reproducible modulation exceeding 
-80% after CpG-ODN treatment in three experiments were 
further analyzed by MS. 
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2.7 ln-gei digestion with trypsin arid extraction of 
peptides 

Protein spots were excised from stained gels and cut into 
pieces. In brief, gel spots were dehydrated with ACN for 
10 min and dried in a vacuum centrifuge. Gel pieces were 
reswelled with 55 mw DTE in 25 mM ammonium bicarbon- 
ate (pH 8.5) at 37°C for 1 h. The solution was then exchanged 
with alleviation solution, which contained lOOrriM iodoaceta- 
mide in 25mw ammonium bicarbonate (pH 8.5), at room 
temperature for lh. After- alleviation, the gel pieces were 
washed twice with 50% ACN in 25 mM ammonium bicarbo- 
nate (pH 8.5) for 15 min. The wash solution was discarded 
and the pieces of gel were dehydrated with ACN for lOmin 
and dried in a vacuum centrifuge. Tryptic digestion was 
initiated by reswelhng the gel in 25 mw ammonium bicar- 
bonate solution with 25 ng of trypsin- (Promega, Madison, 
WI, USA). After incubation at 37°C for 16h, tryptic peptides 
were extracted twice with 50% ACN containing 5% formic 
acid for 15 min with moderate sonication. The extracted 
solutions were pooled and evaporated to dryness in a vacuum 
centrifuge. The dried peptide mixture was dissolved in 0.1% 
formic acid and used for MS. 

2.8 MALDI-Q-TOF MS and protein identification 

Tryptic peptides analyses were performed using a Micromass 
Q-TOF Ultima MALDI (Micromass, Wythenshawe, U.K.) 
equipped with a 337 nm nitrogen laser and operated in 
reflection positive ion mode. Peptide mixtures (1 uL) were 
premixed with 1 pi of the matrix (5mg CHCA in 50% ACN 
with 0.1% TFA) then spotted onto the MALDI target plate. 
Mass spectra were acquired for the mass range of 900- 
3500 Da and the individual spectra from MALDI MS or MS/ 
MS were processed using the Micromass MassLynx 4.0 soft- 
ware. The generated peaic list files were used to query the 
Swiss-Prot database using the MASCOT program (http:// 
wvv^.matrixsdence.com) with the following parameters: 
peptide mass tolerance, 50ppm; MS/MS ion mass tolerance, 
0.25 Da; allowance of missed cleavage, 1; and consideration 
for variable modifications such as oxidation of methionine 
and carboxyamidomethylation of cysteines. Only significant 
hits as defined by MASCOT probability analysis were con- 
sidered initially. In addition, when the PMF matches were 
between 5 and 9, at least one peptide sequence was manually 
checked by MALDI MS/MS analysis. 

2.9 RT PCR analysis 

cDNA from THP- 1 cells was produced with Superscript II 
reverse transcriptase (Invitjogen) using a oligo(dT), 5 primer 
for 1 h at 42°C. PCR of cDNA was performed using specific 
primers for the gene of interest and control p-actin. All PCP 
products were elecrrophoiesed on a 1.5% agarose gel, and 
DNA bands were visualized by staining the gel with ethi- 
dium biornide. 
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2.10 Imrnunoblotting 



3 Results 



Human THP-1 or mouse macrophage RAW264.7 cells 
(5 x 3 x l^/weB) were cultured in a six-well culture 
plate and treated with or without 1.5 pM CpG-ODN for 
the. designated times. After stimulation, cells were harvest- 
ed by cejitrifugation at 3000 x g for 15miri in a refri- 
gerated centrifuge and washed twice with cold PBS buffer 
The cells were fysed on ice for 15min with 300 uL lysis 
buffer (Pierce, Rockford, USA), supplemented with pro- 
tease inhibitor cocktail (Sigma). The lysates were cen- 
trifuged at 12000 x g for ISmin at 4°C, and protein 
concentrarions of supernatant were determined using the 
Bio-Rad Protein Assay (Hercules, CA> USA). The lysates 
(50 |ig of protein/lane) were subjected to 12% SDS-PAGE 
and transferred to NC membranes (Amersham Bio- 
sciences). The membranes were blocked in PBS-0.1% 
Tween 20 (PBST) containing 5% non-fat skim milk at 
room temperature for 1 h, followed by staining with anri- 
ADP-ribosylation factor 3 (ARF-3) monoclonal antibody. 
(0.1 ug/mL; Sigma). The membranes were then incubated 
with horseradish peroxidase-conjugated secondary anti- 
body (dilution, 1:3000) for lh. After washing three times 
with PBST, specific bands were detected by chemilumi- 
nescence according to the manufacturer's protocol 
(Amersham Biosciences). 

2.11 Cell transection and tucif erase assay 

HEK293 cells (5 x 3 x lO^/well) were transfected using 
FuGENE6 (Roche Molecular Biochemicals, Indianapolis, 
IN, USA) plus 0.1 pgp5xNF-KB-juc (Stratagene, La Jolla, CA, 
USA), 0.05 ng pCDNA3.1-P-gaiactosidase, and pCDNA3.1- 
hTLR9 overnight The cells were incubated with or without 
1.5 um CpG-ODN for 8h and then lysed. NF-kB lucifexase 
activity assays were performed as recommended by the 
manufacturer (Promega). 0-galactosidase activity was used to 
normalize the data. 

2.12 Enzyme activity assay 

Pyruvate kinase activity was assayed in a solution (ltnl) 
containing lOOmM Tris-HCl (pH 8.0), lOOmM KC1, 10mM 
MgCl 2 , 0.2 mM NADH, lOmw PEP, K5 idm ADP. lunit of 
lactate dehydrogenase, and an appropriate amount of cell 
lysate from CpG-ODN untreated or treated THP-1 cells. The 
reaction was monitored at 30°C for a period of time by 
measuring the decrease in absoibance at 340 nm. PGK ac- 
tivity was assayed in a coupled reaction wit)) glycol aldehyde 
3-phosphate dehydrogenase (GAPDH) as described by Lee 
[25], In brief, the assay was performed at 30°C in a total vol- 
ume of ] ml containing lOOmM Tris-HCl (pH 7.9), IOjjim 
MgCl 2 . 0.15 mM NADH, 2mM ATP, 6mM 3-phosphoglyce- 
rate, 0.1 mg/inL BSA, 50 mg of GAPDH, 3nd an appropriate 
amount of cell lysate. NADH consumption was monitored at 
340 nm. 



3.1 Effect of CpG-ODN on gene expression profiles of 
human THP-1 cells 

To elucidate the effect of CpG-ODN on gene expression, 
THP-1, a cell line known to express TLR9 and respond to 
CpG DNA (23, 24) was cultured with or without CpG- 
ODN. Since preliminary experiments showed that 15 pM 
CpG-ODN caused more contrasting results between nor- 
mal and CpG-ODN. treated cells, .1.5 um CpG-ODN was 
used throughout the experiments. To distinguish CpG- 
ODN treated samples from the control, total cellular RNA 
of normal and CpG-ODN treated cells was isolated and 
labeled with the fluorescence dyes, Cy5 and Cy3, respec- 
tively. The labeled RNA was then used for hybridization 
with a Human 1 cDNA microchip from Agilent Technol- 
ogies. Of the 13 000 human genes represented on the 
gene array, a total of .55 genes changed expression sign- 
ificantly after 2 h CpG-ODN treatment Among these, 
50 genes were up-regulated while five genes were down- 
regulated by a factor > 2. These genes were sorted by 
functions and are listed in Tables 1 and 2. They included 
notably, 1L-18 receptor accessory protein, MSGA beta 
gene, thioredoxin, pro-pol-dUTPSse polyprotein, Spl40, 
connexin 59 gene, Grb2-13ce 2, en oyl- coenzyme A hydra- 
tase, propionyl coenzyme A carboxylase, cytochrome P450, 
and WNTl inducible signaling pathway protein 2 (W1SP-. 
2) etc. The function of these genes are known to be relat- 
ed to inflammatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduc- 
tion, tumor progression, cell differentiation, proteolysis 
etc. 

Table 1. Genes up-regulated |> 2 fold) in human THP-1 cells after 
2h of CpG-ODN treatment 



Gene name and description 
{changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-18 receptor accessory protein AF077346 

T cell receptor V beta gene X58806 

MGSA, beta gene U03019 

Antigen gene (PA) M21896 

TIED NM_004791 

Platelet activating receptor AF002986 

Antimicrobial defence 

Thioredoxin NM_003329 

Pro-Pol-dUTPase polyprotein AC004748 

Nuclear body protein Sp140 U63420 

Transcriptional regulation 

Putative transcription factor LUZP AI986271 
General transcription factor II, I, pseudogene 1 AI7°0706 
Connexin 59 gene 

L29277 

Basic helix-loop-helix protein class B 1 AF221520 
(BHLHB1) 
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Table 1. Continued 



Gene name and description 
(changed fold > 2) 



Table 2. Genes downr regulated {> 2 fold) in human THP-1 cells 
after 2 h of CpG-ODN treatment 



Genebank 
number 



Regulatory protein 

Advillin - AF041449 

i 

Channel and transport . 

Small GTP binding protein Rab9 U44103 
FXYD domain-containing ion transport AI125364 
regulator 1 

Signal transduction 

SH3-d6main Grb2-like 2 AFQ36268 
Titin X90568 
Titin associated protein {1 65 kD protein) X69089 
KIAA1451 protein AB040884 
Vasoactive intestinal peptide receptor Ul 1087 

Enzyme and protease 

E.noyF Coenzyme A, hydra tase AI800553 
Nephrin, B-type metallocarboxypeptidase U65090 
Propionyl Coenzyme A carboxylase AB011145 
Acyloxyacyl hydrolase (neutrophil) M62840 
Cytochrome P450 U79716 
Intestinal alkaline phosphatase M31008 

Tumor progression and cell differentiation 
. Retinoblastoma 1 L11910 
Human genomic DNA of 9q32 anti-oncogene AB036268 

of flat Epitherium cancer, 

segment 6/10 

WNT1 inducible signaling pathway protein 2 AF 100780 
(WISP-2) 

Structure protein 

Collagen, type IV, alpha 6 D21337 

Beta Myosin heavy chain M58018 

Other 

Homo sapiens Cri-du-chat region mRNA, U52827 
clone NIBB11 

Human mRNA for laminin alpha 5 chain, AB010099 
partial cds. 

NIK like and Thyroxin-binding 283850 

globulin precursor 

Hypothetical protein DKFZp434M0331 AL 1377 20 
Hypothetical protein FLJ11021 similar j AK023985 

to splicing factor 

Hypothetical protein AL049851 

Chromosome 18 open reading frame 1 NM_004338 

Arfaptin 1 AW408785 

Zinc Finger protein 8 IZFP8) M29581 

Zinc finger protein 137 {clone pHZ-30) U09414 

Ollactomedin related FR localized protein AI7384G8 

Cyclin-dependent kinase 8 BE467537 

Integrin, alpha 1 D87462 

KIAA0421 protein AB007881 

KIAA1233 protein AB033059 

Unnamed protein product AK026362 

mi4 X75535 

BC33 1191^1 AAD39268 



Gene name and description 
(changed fold> 2) 



Genebank 
number 



Zinc-finger horneodomoin protein 4 . . BAB03600 

Human protein kinase MEKK2b mRNA, AF239798 

complete cds. . 

GlypicariS U66033 
Human genomic DNA, chromosome 22q11.2, AP000362 

clone N75A12. 

Collagen, type I, alpha 1 274615 



Longer stimulation of THP-1 cells with CpG-ODN (8h) 
resulted in the up-regulation of 58 genes. These genes in- 
cluded notably IL-10 receptor beta, formyl peptide receptor- 
like 1 (FPRLl), vitamin D receptor, nuclear receptor sub- 
family 1 (LxR), early B-cell factor, protein kinase C gamma 
(PKCJ, Nek, Ash, phospholipase C binding protein (NAP4), 
phosphoriboxyl pyrophosphate armdotransferase, dishev- 
eled 3, WISP-2 etc. Analysis of the functions of the 58 up- 
regulated genes showed that they are associated with anti- 
inflammation* transcriptional regulation, intracellular signal 
transduction, tumor progression, cell differentiation, pro- 
teolysis, neurodegeneration, neuroprotection etc. (Table 3). 
AVe also found that the stimulation of THP-1 cells with CpG- 
ODN for different periods of times resulted in different pro- 
files. Several defense related genes such as IL-18 receptor 
accessory protein, Pro-Pol-dUTPase polyprotein, Spl40 and 
connedn 59 were transiently up-regulated at 2 h short stim- 

Table 3. Genes up-regulated (> 2 fold) in human THP-1 cells after 
8 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-10 receptor {beta) - U08988 

Formyl peptide receptor-like 1 tFPRLI) AF081535 

Vitamin D receptor J03258 

NMDAR1 Z32774 

CD44 antigen AW028346 

Nuclear receptor subfamily 1 (LxR) NM_005693 

Neuromedin B receptor M73482 

Transcriptional regulation 

Early B-cell factor (ebf) AF208502 

Neurogenic differentiation 1 (Neuro D) AB018693 

MAX dimerizotion protein tNESH protein) AB037886 

Ribosomal protein S6 kinase AF090421 

ASH2L AB022785 

Regulatory protein 

Hypothetical protein DKFZp434H0820 AL137555 

Peroxisomal farnesylated protein X75535 

LT BP4 AF051344 

Neuronal penti axin II U29195 
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Table 3. Continued 



Gene name and description 
(changed fold > 2) 



Structure protein 

. Tricbohyalin 
Kertain 

Other 

Human transferrin pseudogene 
TIMP-2 

Collagen-like protein 

Human genomic DNA, chromosome 21q, 

section 60/105 
Human genomic DNA, chromosome 21q, 

section 64/105 
KIAA0136 
KIAA0379 
KIAA0489 
KIAA1114 
KIAA1451 
K1AA0756 

Zinc finger protein 267 

Hypothetical protein FU 10633 

Hypothetical protein EUROIMAGE 1955967 

Mybl homolog like 1 

Antizyme inhibitor 

Disintegrinlike and metalloprotease 

treprolysin type) with Thtombospondin 

type 1 motif, 3 
ADP-ribosylation factor 3 (ARF-3J 
Testis specific protein, Y-linked 
Unnamed protein 



Genebank 
number 



iBMRr^jcj^ngs; 899 

Table 4. List of antimicrobial and anti- inflammatory genes 
modulated by CpG-ODN treatment of THP-1 cells 



Channel and transport 
Gamma- aminobutyric 161] A receptor 
ATP synthase subunit F6 ' 
transient receptor potential channel 1 
Choroidererma (Rab escort protein 1) 

Signal transduction 
Protein kinase C, gamma 
Regulator of G-protein signaling 5 
Nek, Ash and phosphotipase C binding . 

protein INAP4) 
Highly similar to adeylate kinase gene 

Enzyme and protease 
Phenylalanine hydroxylase 
Carboxypeptidase AT 
Xyl urokinase 
Pancreatic lipase 
Ubiquitin specific protease 12 
Transmembrane protease, serine2 
Aspartate bat a- hydroxylase 
Phosphoribosyl pyrophosphate 
amidotransferase 

Tumor progression and celt differentiation 

CDC23 
WISP-2 

Microserrrinoprotein, beta 
Dishevelled 3 



NM_004961 
M37104 
Z73903 
X57637 

Z15114 

AI674877 

AB005216 

AB016886 

AA203389 

X67318 

AK001205 

J05125 

AF022789 

U75329 

U03109 

D13757 



AF053977 
AF100780 
M34376 
NM_004423 

L09190 
AF061809 

M22376 
U44383 
U67921 
AP001716 

AP0O1720 

D50926 

AB002377 

AB007958 

AL049732 

AB040884 

AB0 18299 

AF220492 

AK001495 

AK026108 

AK001893 

D88674 

AB002364 



M74491 

M98525 
AK026042 



Gene name * 


fipnphAnk 


Expression fold 




number 










2 h 


o n 


Connexin 59 gene 


L29277 


2.12 ±0.05 


1.74 ±0.08 


IL-18 receptor 


X58806 


2.32 ± 0.21 


1.19 ±0.13 


accessory protein 








Integrin, alpha 1 


X68742 


2.01 ± 0.03 


1.05 ±0.10 


Nuclear body protein 


U63420 


2.22 ±0.11 


1.39 ±0.04 


Sp140 








Pro-PoWUTPase" 


AC004748 


2.33 ± 0.18 


0.95 ±0.03 


polyprotein 








Thioredoxin 


NM_003329 


2.20 ±0.08 


1.07 ± 0.01 


FPRL1 


AF081535 


0.95 ± 0.06 


2.13 ±0.11 


1L-10 receptor 


U08988 


1^2 ± 0.18 


2.21 ± 0.07 


LxR 


NM_005693 


0.90 ± 0.03 


2.37 ±0.31 


Vitamin D receptor 


J03258 - 


1.37 ± 0.23 


2.39 ±0.11 



Expression fold is designated as the ratio of CpG ODN treated 
over control 



illation but weie down-regulated thereafter, while anti- 
inflammatory associated genes such as FPRL1, 1L10 recep- 
tor, vitamin D receptor and LxR were up-regulated after 8h 
stimulation (Tables 1, 3 and 4). 

3.2 Verification of the microarr ay results with RT-PCR 
or Western blott in g 

To verify the results from the microarr ay analysis,, we also 
performed RT-PCR on the up-regulated genes (l-ig. 1). Con- 
sistent with results obtained in the microarr ay gene expres- 
sion analysis, RT-PCR studies showed that the rnRNA levels 
of some selected genes, including ubiquitin specific pro- 
tease 12, regulator of G-protein signaling 5, NAP4 and 
ASH2L, were increased in response to CpG-ODN (Fig. 1, 
Table 3). In addition, the protein expression level of ARF-3 

■ - GpC-ODN CpG-ODN 



Ubiqmlin specific 
protease 12 
Regulator of G- 
prot»in signaling 5 } 

NAP4 



t n ■ ^i i jn, tf 1 n i ^uw ii y i ^ 

Figure 1. Induction of various genes by CpG-ODN. THP1 cells 
were stimulated with medium alone, 1.5 pM GpC-ODN (as the 
negative contiot) or CpG-ODN for the indicated times.- RT-PCR 
was then performed to analyze gene expression levels, p-actin 
was used as an internal control. The experiment was repeated 
three times with similar results. 
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Actin 



(B) RAW 



ARF-3 ^^^0^^^ 
Actin 

Figure 2. CpG-ODN induced ARF-3 protein expression in human 
THP-1 or mouse RAW264.7 cells. THP-1 (A) or mouse RAW264.7 
(B) cells were incubated whh 1.5 um CpG-ODN for the indicated 
time points. The protein expression level of ARF-3 was deter- 
mined by Western blotting of cell extracts using anti-ARF-3 anti- 
body. The experiment was repeated three times with similar 
results. 

was shown to increase in Western Hotting analysis in cell 
lysates from THP-1 ceUs treated with CpG-ODN for 8-24 h 
(Fig. 2A). Similar studies showed that the ARF-3 protein was 
aJso induced by mouse specific CpG-ODN in other TLR9 
expression cell lines such as the mouse macrophage 
RAW2647 cell line (Fig 2B). 

3.3 Proteins regulated in CpG-ODN stimulated THP-1 
cells 

To further assess whether there was any correlation between 
regulation of gene expression and expression of cellular pro- 
teins, a proteomic approach was adopted to identify protein 
expression profiles. THP-1 cells were treated with CpG-ODN 
for defined times (from 8 to 40h), and their cytoplasmic 
proteins were extracted for 2-DE analysis. Although the use 
of high concentrations of urea might give us a broader view 
of aD the proteins aiTected by CpG-ODN r preliminary results 
from 2-D gels showed that the resolution of the protein mix- 
tures were not satisfactory. To improve and get the best reso- 
lution from 2-DE, total proteins were roughly separated into 
supernatant and precipitated fractions using 50% saturated 
ammonium sulfate solution. To remove salts and other con- 
taminants, both protein fractions were precipitated with TCA 
solution and then subjected to 2-DE. By protein spot deter- 
mination analysis, about 500 and 450 well-resolved spots 
were observed on each pH 3.0-10.0 gel for precipitated or 
supernatant fractions, respectively. Comparative analysis of 
2-DE between treatments and control showed that the inten- 
sities of the protein spots from the ammonium sulfate pie 
opitated fraction did not change, while several protein spots 
weie up-regulated by at least -80% in the supernatant frac- 
tion of 8h CpG-ODN stimulated THP-1 cells (Fig. 3). 

The protein spots were individually excised from gels for 
further identification. After trypsin digestion, several protein 
spots were identified without ambiguity by MS MALD1- 




Untreated 




Figure 3. The effect of CpG-ODN on the 2-DE profile of THP-1 
cells. THP-1 cells were treated with or without 1.5 pM, CpG-ODN 
for 8h. Total proteins were extracted and roughly separated into 
two fractions by 50% saturation with ammonium sulfate. The 
supernatant fraction was then separated by 2-DE. Protein spots 
were visualized by SYPRO Ruby staining. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. Protein spots were 
identified by trypsin digestion and MS. Localization of protein 
spots 1 tenoly^coehzyme A hydratase), 2 (proteasome a) and 3 
tcyctophilin A; two isoforrns) are shown. The experiment was 
repeated three times with similar results. 



TOF. These proteins included HS P60, HSP90, cydophi- 
lin A, enoyl-coenzyme A hydratase, eukaryotic translation 
elongation factor, proteasome or and p chain and ATP syn- 
thase beta chain {Table 5). Similar experiments on cells 
treated for a longer period of time with CpG-ODN stimu- 
lation (25 h) revealed that 27 protein spots were changed in 
intensity by at least -80%. These protein spots contained 
members of HSPs (HSP27, hsc70, grp78 and grp94), 
metabolic enzymes (phosphoglycerate kinase (PGK) and 
pyruvate kinase (PYK)), macrophage capping protein and 
cyclophilin A (Table 6). Among these proteins, macrophage 
capping protein, PGK, PYK, cyclophilin A and HSP27 
(Figs. 4 and 5A) .were found to be up regulated. Interest- 
ingly, we found that a truncated form of grp78 with an' 
expected mass of 25kDa and pf of 5.3 was up-regulated 
while grp78 itself was down- regulated. A similar situation 
was also found for grp94 and hsc70 and their truncated 
derivatives (Table 7 and fig. 5). In addition, we also ob- 
served six down-regulated protein spots on 2-D gels in 
samples after 25 h CpG-ODN treatment. Among these six 
proteins, we have successfully identified three as 40s ribo- 
somal protein SA, grp78 and Iisc70, respectively (Table 6), 
while the other diree, due to their relative low abundance, 
have not been identified yet. 



© 200b WILEY- VCH Veitag GmbH & Co. KGaA, Weinheii 



www. pi otenmics-iournal.de 



proteomics 2005, 5, 894-906 

Table 5. List of proteins modulated by 8h CpG-ODN treatment 



901 



Protein name 


Accession 




P/ 


Matched 


Cover- 


Score 


Expression 




no. 


(theor.) 


(theor.) 


no. 


age 




fold 


ATP synthase beta- chain 


gil 14549 


56 525 


5.26 


16 


58 


171 


2.33 ± 0.06 


CydophilinA 


P05092 


17 870 


7.82 


5 


35 , 


62 


2.85 ± 0.13 


Enoyl-Coenzyme A 


gi4503447 s * 1 35 971 


6.61 


11 


44 


62 


2.52 ±0.05 


hydratase 
















Eukaryotrc translation 


gi4503481 


50 087 


6.25 


6 


25 


68 


3.41 ± 0.21: 


elongation factor 
















HSP60 




57 963 


5.24 


13 


27 


76 


2.78 ± 0.03 


HSP90-beta 


P08238 


83 133 


4.97 


10 


18 


65 * 


2.36 ±0.10 


Proteasome a chain 


gi4506181 


25 882 


6.92 


11 


59 


80 


2.52 ±0.11 


Proteasome p chain 


g>4506193 


26472 


8,27 


9 


46 


84 


3^5 ±0.17 



Expression fold is designated as the ratio of CpG-ODN treated over control 

a) T NCBI accession number 

b) Swiss-Prot accession number 



Table 6. List of proteins modulated by 16 and 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot M, 


p/ 


Match- 


Cover- 


Score 


Expression fold 




no: 


(theor.) 


(theor.) 


ed no. 


age% 




16h 


25h 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 


62 


2.53 ± 0.02 


2.48 + 0.15 


78 kDa glucose regulat- 


P11021 


72 288 


5.07 


13 


30 


148 


0.61 ± 0.04 


0.29 ± 0.06 


ed protein (grp78) 


















HSP27 


P04792 


22 768 


5.98 


12 


59 


124 


1.00 ± 0.01 


2.61 + 0.12 


Heat shock cognate 


PI 1142 


70 854. 


5.37 


16 


34 


114 


0,64 ± 0.01 


0.31 ± 0.03 


70 kDa protein 


















(hsc70> 


















Macrophage capping 


P40121 


38 494 


5.88 


9 


30 


58 


1.31 ±0.01 


2.58 ± 0.03 


protein 


















Phosphoglycerate 


P00558 


44 284 


7052 


11 


33 


71 


2.32 ± 0.11 


4.23 ±0.19 


kinase 


















Pyruvate kinase 


P14618 


57 710 


7.95 


17 


32 


114 


1.65 ± 0.07 


2.70 + 0.12 


40s ribosomal protein 


P08865 


32 833 


4.79 


5 


23 


61 


0.33 ± 0.02 


0.35 ± 0.05 


SA (RSP40) 



















Table 7. List of truncated proteins detected in THP-1 cells after 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot 


M, 


Pt 


Matched 


Cover- 


Score 


Expression 




no. 


(obs.) 


(obs.) 


no. 


age% 




fold 


94 kDa glucose- regulated 


P14625 


; -59 700 


-5.00 


12 


14 


104 


New** 


protein tgrp94) 
















Truncated form of grp78 


P11021 


-25000 


-5.30 


12 


22 


. 75 


New 


Truncated form of hsc70 


P11142 


-22 000 


-5.80 


12 


20 


96 


(Mew 


Truncated form of hsc70 


P11142 


-19000 


-6.10 


11 


18 


113 


New 



d) New designated proteins detected in the CpG-ODN treated gel but not in the corresponding control gel 
3.4 Comparison of microarray and proteomic results 



Table 8 shows the expression of six genes and their corre- 
sponding proteins that were modulated by 8h treatment of 
THP-1 cells with CpG-ODN. Besides enoyl-coenzyme A 
hydratase, there was poor correlation between the expression 



of genes and their corresponding proteins (Table 8), sug- 
gesting that more in-depth studies were needed. To further 
evaluate whether changes observed in protein expression 
correlated with changes in mRNA levels, we randomly chose 
two proteins (PGK and PYK) that were induced after 16b 
CpG-ODN treatment and determined their mRNA levels by 
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Figure 4. 2-0 gel electrophoretic analysis of CpG-ODN-treated 
THP-1 cells. (A) Total cell protein from unstimulated THP-1 cells 
was subjected to 2-DE JB) THP-1 cells were treated whh or with- 
out 1.5 mm CpG-ODN for defined times. Cellular proteins were 
extracted and separated by 2-DE. Several up-regulated proteins 
are shown in the SYPRO Ruby staining gel. Comparison of CpG- 
ODN treated. THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. The experiment was 
repeated three times with similar results. 




Figure 5. ?-D gel electrophoretic analysis of CpG ODN-treated 
THP-1 cells. 'THP-1 cells were treated with or without 1.5 CpG 
ODN for defined times. Cellular proteins were extracted and 
separated by 2-DE. Protein spots were delected by SYPRO Ruby 
staining. (A> Expression of HSP27 was induced by increasing the 
period of CpG-ODN stimulation. A truncated form of hsc70 was 
detected on the gel. (B) The native form of grp78 was detected in 
untrcaied cells, while the truncated form of grp78 was observed 
after 25 h stimulation. The experiment was repeated three times 
with similar results. 
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Table 8. Cqmpar ison of gene and protein expression levels in 
THP-1 cells after 8 h CpG-ODN treatment 



Protein name 


Gene 


Protein 




expression 


expression 




fold from 


fold from 




microarray 


2-Dgel 


Enoyr-Coenzyme A hydrolase 


1.72 ± 0.31 


2.52 ± 0.05 


Eukaryotic translation 


0.95 ± 0.01 


3141 ±0.21 


elongation factor 






HSP60 


0.94 ± 0.07 


2.78 ±0.03 


HSP90-beta 


1^58 ±0.11 


2.36 ± 0.10 


Proteasome ct chain 


0.98 ± 0.03 


2.52 ±0.U 


Proteasome 0 chain 


1.09 ±0.11 


3.85 ± 0.17 



RT-PCR. Our results showed that inRNA levels of PYK 
increased after 16 h CjiG-ODN treatment, while mRNA 
levels of PGK were dramatically increased after 24 h stimu- 
lation (Fig. 6). In addition, we also performed enzyme activ- 
ity analysis and found that the activity of PGK and PYK were 



3.5 




0 2 4 8 16 24 

Time (b) 



Figure 6. Activities of PYK and PGK induced by CpG-ODN. THP1 
cells were stimulated whh medium alone, 1.5 pM GpC ODN tas 
the negative control) or CpG-ODN for the indicated times. Cell 
lysales were extracted and nssDyed tor (A) PYK and (0) PGK ac- 
tivities. Data represent mean i SEM. \n= 3). 
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Figure 7. Induction of PYK and PGK transcripts by CpG-ODN. 
THP1 cells were stimulated with medium alone, 1.5 GpC-ODN 
or CpG-ODN for the indicated times. RT-PCR was then performed 
to analyze gene expression, pectin was used as an internal con- 
trol. The experiment was repeated three times with similar 
results. 



indeed increased by a factor -2.5 after CpG-ODN stimula- 
tion (Kg. 7). To confirm that mRNA induced by CpG-ODN 
would also be accompanied by an inoease in protein 
expression even though it was not detected in 2-D gel analy: 
sis, we used more sensitive and specific Western blotting 
analysis. As shown in Table 3 and Fig. 2, ARF-3 was identi- 
fied in the microanray gene profile but not in the proteomic 
expression profile. Nevertheless, we observed enhanced pro- 
tein expression of ARF-3 after CpG-ODN stimulation by 
Western blotting. Moreover, to investigate whether the up- 
regulation of ARF-3 by CpG-ODN is mediated through the 
TLR9 pathway, TLR9-deficient HEK293 cells were transiently 
cotransfected with hTLR9 and luriferase-reporter gene driv- 
en by a NF-xB-dependent promoter. Our data showed that 
NF-kB activity of untransfected HEK293 cells were not 
responsive to CpG-ODN stimulation, while in transfected 
HEK293 cells expressing hTLR9, NF-kB luciferase activity 
was up-regulated 12-fold after 8h CpG-ODN stimulation. 
The activation of NF-kB induced by CpG-ODN was blocked 
by pretreatment of the transfected cells with an ARF-3 in- 
hibitor, such as brefeldinA (Fig. 8), suggesting CpG-ODN 
induces ARF-3 and activates NF-kB after the interaction of 
CpG-ODN with TLR9. 



4 Discussion 

In this study both microarray and proteomic approaches 
were used to evaluate the eJTect of CpG-ODN on gene/pro- 
tein expression profiles of THP-1 cells at several time points. 
Comparison of the gene expression profiles showed that 
stimulation of the cells with CpG-ODN for different periods 
of time resulted in different profiles (Tables 1-4). The differ- 
ences in mRNA expression between the cells with shori and 
long stimulation could be attributable to the low reproduci- 
bility. However, to avoid experimental variations, we not only 
used the same batch of rniooarrays from the same manu- 
facturer but also applied the samples of short and long term 
stimulation at the same time. Jn this way, we found that the 
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changes in expression fold of mRNA after CpG-ODN treat- 
ment were quite reproducible as shown by their mean 
'- SEM (Table 4). A more Kkefy explanation for the difference 
in the expression level of mRNA after different periods of 
stimulation with CpG-ODN is that the transient increase or 
decrease in these mRNA by CpG-ODN plays a significant 
role in modulating biological functions. For example, we 
found that the IL-18 receptor accessory protein from THP-1 
cells was unregulated after 2h of CpG-QDN stimulation. 
The 1L-18/IL18R system is known to activate Hil-mediated 
immune responses that play a critical role in host defense 
against infection [26]. Together with JL-18/IU8 R, several 
genes for- antimicrobial defense were also increased, includ- 
ing thioredoxin, Pro-Pol-dUTPase polyproteiri and Spl40. 
After 8h of CpG-ODN stimulation, however, none of these 
genes was activated any more (Table 4); Since sustained or 
excessive production of these antimicrobial molecules might 
lead to inflarnmarion and cellular damage 127], a plausible 
explanation is that THP-1 cells fight against the invasion of 
pathogens by up-regulating antimicrobial defense-associated 
genes at an early stage of stimulation and then shut them 
down to avoid over-activation. Whether this explanation is 
true remains to be verified. 

It is noteworthy that our data also identified the up-reg- 
ulation of several anti-infbmrnatory associated genes after 
8h of CpG-ODN stimulation. These genes included FPRL3, 
1L-10 receptor, vitamin D receptor, and LxR (Table 3). FPR 
and FPRL1 have been defined as chemotnctic factors 
involved in host defense against bacterial infection and in the 
clearance of damaged cells. Additional studies have indicated 
that FRPL1 interacts with a menagerie of structurally diverse 
pjo- and anti-inflammatory ligan ds associated with diseases, 
including amyloidosis, Alzheimer's diseases, prion disease 
and HIV |28, 29]. Therefore, FPRL1 may play an important 
role in regulating and/or balancing the production of pro- 
and anti-inflammatory molecules in CpG stimulated THP-1 
ceDs. Additionally, a recent study has demonstrated that LxRs 
and their bgands act as negative regulators of macrophage 
inflammatory gene expression and inhibit the expression of 
inflammatory mediators such as inducible nitric oxide syn- 
thase, cydooxygenase and IL-6 in response fo bacterial 
infection or LPS stimulation [30). Of interest, we found that a 
transaiption factor gene connexin 59, a regulator of IL-6 
expression, was up-regulated after 2 h of CpG-ODN stimula- 
tion. It is thus likely that CpG-ODN stimulation of THP-1 
cells for 2h may induce the expression of the proinflamma- 
tory cytokine IL-6 through the up-regulation of the connexin 
59 gene, while 8h of CpG-ODN treatment may counter- 
balance the initial inflammatory response by inducing LxR to 
inhibit JL-6 production. 

Signal transduction molecules play an im port ant role in 
cellular activation. Intracellular signal transduction systems 
employing various intermolecular interactions through 
docking elements, including SH2 and SH3 domains, have 
been reported (31-33J. Here we found that THP-1 cells 
treated with CpG-ODN for 2 h up- regulated gene expression 
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of Grb2-like piotein (which contains an SH3 dornain), while 
8h of stimulation induced Nek, Ash and phospholipase C 
binding protein (NAP4 which contains an SH2 domain). It is 
thus possible that Grb2-like protein and NAP4 may play 
important roles in CpG-ODN mediated signaling pathways. 
Furthermore, recent studies have also revealed that binding 
of CpG-DNA to TLR? results in activation of "JNK [34). Since 
JNK is activated by Nek adaptor protein and Nek interacting 
kinase [35, 36], it is possible that CpG-ODN may activate JNK 
via up-regulation of NAP4. Although a recent publication 
described the gene expression profiles of a cultured mouse 
macrophage cell line after CpG-DNA stimulation [34], their 
microarray results were only conducted at one rime-point 
(6h stimulation). Moreover, they did not report the meas- 
urement of protein expression profiles in response to CpG - 
ODN stimulation. 

Comparison of the gene and protein expression profiles 
showed that there was discordance between mRNA and pro- 
tein levels (Table 8). Similar discordance between the 
expression partem of genes and proteins was also reported in 
other system using different stimuli [37-41]. The dis- 
cordance between mRNA and protein levels could be due to 
screening capability such as detection sensitivity, choice of 
cut-off point, guantitativity of microarray and 2-D gels, as 
well as time discrepancy between gene and protein expres- 
sion [39, 40, 42, 43]. Alternatively, it could also be explained 
by posMiancriptional events, such as alternative splicing or 
PTM |39, 40, 42, 43], Another possible explanation is that 
most of the spots observed in the 2-D gels are isoforms of 
some proteins. The intensity of each spots does not neces- 
sary represent total amount of a certain protein and thus 
does not correlate with its mRNA level. Our finding that 
microarray results correlated better with Western blotting 
results (e.g., ARF-3 in Fig. 2), an approach more suitable than 
2-D gels for determining the total amount rather than iso- 
forms of a given protein, seems to suggest that formation of 
isoforms should be carefully taken into consideration when 
one tries to correlate mRNA and protein expression data. 

Using a proteomics approach, we found that CpG-ODN 
treatment up-regulated the expression of many proteins 
including HSPs, metabolic enzymes, structural proteins, as 
well as macrophage capping protein, cyclophilin and protea- 
some a and 3 chain etc. HSPs are the most abundant and 
ubiquitous soluble intracellular proteins. They are up-regu- 
lated by various stressors including temperature, glucose 
deprivation, microbial infection and cancer |44]. They func- 
tion as molecular chaperones to prevent protein aggregation 
and contribute to the folding of nascent and altered proteins. 
In addition, they aje able to regulate immune responses, 
including production of inflammatory cytokines and che- 
mokines and activation or maturation of immune cells [45, 
46]. Beside HSPs, cyclophilin as well as proteasome a and p 
chain have also been reported to be involved in the immune 
response [39, 47]; proteasome p chain is consistently up- 
regulated in human neutrophils following LPS exposure (39). 
Our finding that the piotein levels of HSPs, cyclophilin, and 
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proteasome a and 0 chain were increased after CpG-ODN 
treatment suggests that these molecules might play a role in 
the immunostimulating effect of CpG-ODN. To what extent 
these proteins contribute to the immune responses of the 
cells to CpG-ODN is currently under study. Proteomk anal- 
ysis also showed that truncated forms of grp78, grp94 and 
hsc70 were induced, a phenomenon similar to calreticulin 
observed by Richards and his coworkers [48], The expression 
of full length hsc70 and grp78 were decreased while the 
levels of their truncated derivatives was increased after CpG- 
QDN treatment These-results suggest that the degradation 
of these proteins has been enhanced We also found that 
proteosome a and p chains as well as ubiquitin specific pro- 
tease 12 were increased by CpG-ODN. Whether these 
enzymes or other enzymes were responsible for the genera- 
tion of truncated hsc70 and grp78 remains to be elucidated. 

Cells rely on multiple signaling pathways to determine 
their fates of survival, proliferation or apoptosis \4% In fact, 
apoptosis plays an important role in regulating pathogen 
infection. To be able to grow and replicate in the target cells, 
pathogens may have to block apoptosis. Results from several 
laboratories have made it clear that HSP70 and HSP27 pro- 
tect cells not only from heat, but also from most apoptotic 
stimuli |4S, 50] by binding to Akt and subsequently mediat- 
ing anti-apoptotic activity through activation of Akt [51-53]. 
Since our data revealed that CpG-ODN induced the expres- 
sion of HSP90 and HSP27, it is possible that CpG-ODN 
might prevent apoptosis by up-regulation of HSPs. 

Interestingly, our microarray data also showed that CpG- 
ODN mediated the induction of a set of genes associated 
with tumor progression and cell proliferation. Among these, 
one gene, W1SP-2, was up-regulated by. CpG-ODN after both 
2 and 8h stimulation. WISP genes were first identified as 
downstream targets of the Wnt-lp-catenin signaling path- 
way They belong to the CCN family of growth factors that 
have been receiving increasing attention lately due to some 
of the family members having been reported to be involved 
in angiogenesis and rum ori genesis [54 J. It would be inter- 
esting to evaluate whether CpG-ODN plays a role in angio- 
genesis and tumorigenesis by regulating W1SP-2. In addi- 
tion, we found that some genes associated with neurode- 
generation or neuroprotection, such as FPRLl, NMDAR 
(NMDA) receptor, PKC and dishevelled 3 were up-regulated. 
To our knowledge, this is the first report to suggest an asso- 
ciation between these genes and CpG-ODN stimulation. As 
mentioned above, FPRLl plays a crucial role in proin- 
flammatory aspects of systemic amyloidsis and neurodegen- 
erative disease sxich as Alzheimer's disease and prion disease 
[28J. NMDAR, PKC and disheveled are involved in modxilal- 
ing amyloid precursor piotein metabolism, which is central 
to the pathogenesis of Alzheimer's disease [55-57]. Most 
notably, recent studies have shown that the TLR4-dependent 
pathway is involved in neurodegeneration of the central 
nervous system [58]. Whether CpG-ODN moieties of patho- 
gens play any role in neurodegenerative diseases such as 
Alzheimer's remains to be elucidated. 
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Figure 8. ARF-3 participates in the CpG-ODN-TLR9-NF-KB path- 
way. HEK293 cells were cotransferted with p5xNF-»cB and human 
TLR9: After overnight transfection, the cells were incubated.with 
or without 1.5 mm CpG-ODN for 8h in the presence or absence of 
increasing concentrations of the ARF-3 inhibitor brefeldiri A. 
After incubation, cells were rysed and NF-xB lucif erase activity 
was measured. Data represent mean ± SEM. {n = 3). 

Exposure of cells to LPS or microbial infection has been 
known to induce several genes encoding metabolic enzyme 
[34, 39). Our microarray data also revealed that a large number 
of genes encoding proteins involved in energy synthesis and 
fatty acid oxidation, such as en oyl- coenzyme A hydratase, 
propionyj coenzyme A carboxylase and cytochrome p4 50 
were activated by CpG-ODN treatment. In addition, we found 
that other proteins such as ARF-3 were up-regulated (Table 3, 
Fig. 8). ARFs are 20kDa GTPases of the ras superfamily that 
are critical to vesicular trafficking, including exocytic protein 
transport and endocytosis [59, 60). This study demonstrates 
for the first rime that ARF-3 is involved in the activation of N F- 
kB induced by CpG-ODN (as shown in Fig. 8) 

CpG-DNA/ODN has been shown to elicit primarily 
responses via the TLR9/MyD88 dependent pathway (38, 21, 
22]. Chromosome location analysis showed that instead of 
localizing on one or two chromosome, the genes/proteins 
modulated by CpG-ODN stimulation are scattered on all 
chromosomes except chromosomes 23 and 24. These results 
seem to suggest that CpG-ODN either affects multiple chro- 
mosomes simultaneously or subsequently via cascades of 
cellulaT messengers. More studies are needed to elucidate its 
mechanism of octions. 



5 Concluding remarks 

In summary, by using microarray and proteom'ic approaches 
to evaluate the effect of CpG-ODN at different time points, 
we have found that genes/proteins regulated by CpG-ODN 



are related to irjJkmrnatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transaction, 
tumor progression, cell differentiation, proteolysis, anti-apop- 
tosis as well as neurodegenera tion and neuroprotection. Our 
results may help delineate the CpG-ODN mediated pathway 
and contribute to further understanding of mechanisms that 
link innate immunity with acquired immune re*>ponse{s). 
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Institute of Medical Biochemistry, University of Oslo, Norway. 

^^^^^ZfZr^^ ^ of Che various 

cDNA probes, iso!r^^m!^l^ T teSticU,ar CeI1 ^ 

mRNA and protdn^ere eSST^J? 1 measureme ^- Amounts of 

germ cells (pachytene Zc^o^T h *** 1 ^ CUltUrfed P e ^b"lar cells, 
whole teste^fromT o^r^ as well as 

between the amount of STSSS ^ k ' ^ WW * 8 °° d COrreIation 
of ceH, such as germ ^^^S^^ * 
amounts of RII beta mRNA were founS ^ ^ was not always the case. Large 
immunoreactive prot^waslw F^^l* ™ Where3S the am0UDt of 
specific mRNAsfbr RI T.oT/T* ° f sma,, - si ^ germ cell- 

developing rat testi! LfLrfo to 4oSwf"t?the f ^ ^ ^ f0Und * * e 
only partially reflected at the protein fevel vZchl ^ ° f mRNA Were 

spermatids were practically de^o bo tmC^m^TT" 
spermatid differentiation, there was a decrease in RI a^ha ^H ^ ^ DUnng 
protein. Cell specific distribution of me va^ol P^A ^ 30 Urease in Ril alpha 
described. Jnsome^ 

demonstrated, which clearly surest cellsneHfiV^J and protem were 

for some of these mRNAs oartfc!^ m translational efficiencies 

PMID: 8107013 [PubMed - indexed for MEDLINE] 
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Quantification of CK20 gene and protein expression in colorectal cancer 
by RT-PCR and immunoliistoclieaiistry reveals inter- and intratumonr 
heterogeneity. 

LassmannS, Bauer M, Soong R . Schregtmann J . Tahiti K , NahrigJ , Ruger R 
Hofler H , Werner ML ' 

Pathologists Institut, Universitatskiinikum Freiburg, Albertstrasse 19, 79104 Freiburg, 
Germany, lassmann@ukl.uni-freiburg.de 

Cytokeratin 20 (CK20) is an epithelial protein expressed almost exclusively in the 
gastrointestinal (GI) tract and is widely used as immunohistochemical marker for routine 
diagnosis. In contrast, CK20 gene expression is not an established marker for the 
classification of tumours and the detection of disseminated cancer cells in colorectal 
cancer. Recently, real-time reverse transcriptase polymerase chain reaction (RT-PCR) has 
provided the means for reproducible and quantitative investigation of molecular markers. 
This report directly compares CK20 mRNA and protein expression in serial sections of 
archival, formalin-fixed, paraffin-embedded (FFPE) colorectal adenocarcinomas. CK20 
expression was detected by immunohistochemistry (1HC) in 60/63 (95.2%) cases by 
conventionatRT-PCR in 58/60 (96.7%) and by quantitative RT-PCR using the 
LightCycler (LightCycler is a trademark of a Member of the Roche Group) System in 
29/32 (90.6%) microdissected cases, one case yielding variable results. Despite the high 
detection rate of all three techniques, marked heterogeneity of CK20 expression was seen 
between different cases and also widiin individual cases. CK20 expression profiles were 
not related to particular histopathologic^ features of the tumours. A good correlation (r = 
0.8964) was found between COO mRNA and protein expression by comparing 
quantitative RT-PCR with IHC in 32 cases. Tins was also true for selected heterogeneous 
tumour cells within individual cases. Both RT-PCR and IHC are therefore valuable tools 
for CK20 detection in colorectal adenocarcinoma, with real-time RT-PCR providing 
supplementary quantitative information. This suggests a promising supportive role for 
quantitative RT-PCR in molecular pathology. Copyright 2002 John Wiley & Sons, Ltd. 

Publication Types; 

• Evaluation Studies 
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Expression of fcUc multidrug resistance-associated protein (MRP) mRNA 
and protein in normal peripheral blood and bone marrow haemopoietic 
cells. 

Lcgrand O, Pcrrot JY, Tang R . Simonm G . GurbuxaniS . Zittoun R , Marie Jr . 

Laboratoire de Cinetique et de Cultures Cef lulaires, Hotel Dieu, Paris, France. 

We studied the expression of multidrug resistance-associated protein (MRP) in normal 
haemopoietic cells from peripheral blood and bone marrow. The MRP mRNA levels 
were estimated by RT/PCR and in situ hybridization (ISH) assay, and the protein levels 
by flow cytometry. 21 samples of peripheral blood and 21 samples of bone marrow (1 1 
normal bone marrow donors, 10 patients in complete remission after chemotherapy for 
large cell lymphoma or acute myeloid leukaemia) were analysed. In peripheral blood the 
mean MRP mRNA level in CD3+ cells was statistically higher than in the other cells (3- 
fold by the methods used). The levels of MRP in CD3+ varied from one individual to 
another (4.5-34,8 units by RT/PCR and 5-23 grains/cell by ISH); however, this was 
proportional to the variation in all the cell lineages of same iudividual (r - 0.84). In bone 
marrow the mean MRP levels of the various cell lineages (including CD34+-) were 
similar to the basal level in HL60 cells. Individual expression levels were again variable; 
however, there was no difference between untreated normal bone marrow and post 
chemotherapy normal bone marrow. MRP protein expression was determined by flow 
cytometry with the monoclonal antibody MRPm6. The CD4+ lymphocytes exhibited a 
higher MRP protein expression than the other cell lineages, including CD8+ cells. There 
was a good correlation between the three methods used (RT/PCR and ISH, P = 0.000 1, r 
- 0.87; RT/PCR arid flow cytometry, P - 0.000 1 , r = 0.8 5; ISH and flow cytometry, P = 
0.002, r = 0.67); . 

PMID: 8757504 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor enhances cardiac allograft 
arteriosclerosis. 

Lemstrom KB , KrebsR, Nykancn AL Tikkanen JM . Sihvola RIC Aalcola EM 
Hayry_PJ, WoodJ, Ajitalo K, Yla-Herttuala S Koskiacn PK . 

Cardiopulmonary Research Group, Transplantation Laboratory, University of Helsinki 
and Helsinki University Central Hospital, Helsinki, Finland. Karl.Lemstrom@helsinki.fi 

BACKGROUND: Cardiac allograft arteriosclerosis is a complex process of alloimmune 
response, chronic inflammation, and smooth muscle cell proliferation that includes cross 
talk between cytokines and growth factors. METHODS AND RESULTS: Our results in 
rat cardiac allografts established alloimmune response as an alternative stimulus capable 
of inducing vascular endothelial growth factor (VEGF) mRNA and protein expression in 
cardiomyocytes and graft-infiltrating mononuclear inflammatory cells, which suggests 
that diese cells may function as a source of VEGF to the cells of coronary arteries Linear 
regression analysis of these allografts with different stages of arteriosclerotic lesions 
revealed a strong correlation between intragraft VEGF protein expression and the 
development of intimal thickening, whereas blockade of signaling downstream of VEGF 
receptor significantly reduced arteriosclerotic lesions. In addition, in cholesterol-fed 
rabbits, intracoronary perfusion of cardiac allografts with a clinical-grade adenoviral 
vector that encoded mouse VEGF(I64) enhanced the formation of arteriosclerotic 
lesions, possibly secondary to increased intragraft influx of macrophages and 
neovascularization in the intimal lesions. CONCLUSIONS: Our findings suggest a 
positive regulatory role between VEGF and coronary arteriosclerotic lesion formation in 
the allograft cytokine microenvironment. 
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CHAPTER 29 

Regulation of transcription 



The phenotypic differences lhat distinguish the 
various kinds of ceils in a higher eukaryote are 
largely due lo differences in the expression of 
genes thai code for proteins, that is, those tran- 
scribed by RNA polymerase II. In principle, the 
expression of these genes might be regulated at 
any one of several stages. The concept of the 
"level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun. H could be regulated in a gene- 
specific way at any. one of several sequential 
sieps- We can distinguish (al least) five poten- 
tial control points, forming the series: 

Activation of gene structure 
I 

Initiation of transcription 
I 

Processing the transcript 

i . 

Transport to cytoplasm 
I 

Translation of mRNA 

The existence of the first step is implied by 
'he discovery that genes may esisl in either of 
two structural conditions. Relative to the state 
of most of the genome, genes are found in 
3n "active" stale in the cells in. which they 
are expressed (see Chapter 27). The change of 
structure is distinct from the act of transcrip- 
tion, and indicates that the gene is "transcrib- 
able* This suggests that acquisition of the 
"active* structure must be the first step in gene 
expression. 

Tmii script ion of a gene in the ,-»( tjve Male is 



controlled at the stage of initiation, Inat is, by 
the interaction of RNA polymerase with its pro- 
moter. This is . now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
28). For most genes, tins is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example, via antilermination 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
tion at the 3' end; Introns must be spliced out 
from the transcripts of interrupted genes. The 
mature RNA musi be exported from the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at the level of nucJear 
JIN A might involve any or att of these stages, 
but the one for which wc have most evidence 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, the translation of an mRNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
adult somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
't he mechanism is presumed to involve the block- 
ing of initiation of translation of some mRNAs by 
specific protein factors. 

But having acknowledged thai control of gene 
expression am occur at multiple stages, and 
that production of UNA cannot inevitably be 
equated with production of protein, if is clear 
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848 I Chapter 29 



that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
thai causes a gene to respond to such a factor 
is called a response element; examples are the 
_HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found hi dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
ei lends for a short distance on either side of 

if 



■ Table 29.1 inducible transcription factors, bind to" ': 
' response elements that identify groups of promoters 
"or enhancers subject to coordinate control.. : 



Ptegulolory Agenl Module Consensus Factoi 

Heat shock USE CNNGAANN1CCNNG HSTF 

Glucocorticoid GRE 7GGTACAAXT GTTCT Receplor 

Phorbol estei IRE 1 GACT CA " API 

Serum GRE CCAlATTAGG ' SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoinl, but are usually <200 bp upstream 
of it The presence of a single element usually 
is sufficient to confer, the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pro- 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other: 
usually an HSE is found in a promoter, ^vhile a 
GRE is found in an enhancer. We assume that 
all response elements function by the same 
general principle. A gene is regulated by a 
sequence at the promoter or enfianccr that is 
recognized by a specific protein. Jhe protein 
functions as a transcription factor needed for 
}\NA polymerase to initiate. Active protein is 
available only under conditions when the gene is 
to be expressed; its absence means that the pr& 
motcr is not activated by this particular circuit. 

An example of a situation in which many 
genes are controlled by a single factor is pro- 
vided by the heat shock response. This is com- 
mon to a wide range of prokaryotes and 
eukaryoles and involves multiple controls of 
gene expression: an increase in temperature 
turns off transcription of some genes, turns on 
transcription of the heal shock genes, and 
causes changes in the translation of mttNAs. 
The control of ihe heal shock genes illustrates 
the differences between prokaryotic and 
euk aryotic modes of control. In bacteria, a new 
sigma factor is synthesized thai directs }W 
polymerase nonenzyme to recognize an allcr- 



PAGE 4/i * RCVD A 7 W?y?DDb 17:35:38 F'M |P;« »fjc Daylight Vwrn-J ' SVR:SVCS0l/0 • DN)D:Sf.06 ■ CSID:&13 9i? 9303 ' DURATION (rnm ss):0? ?8 



U6 b&SS±SiI isSci - 2003 Mar;44(3):I299-304. Related Articles. Unks 



Retinal preconditioning a„d the induction of heat-shock protein 27. 
UY, RothS, LascrM, MaJX, Crossori CE . 

^S^^u^ 010 ^ Medica ' U " iversity ° f South Caro,ina > Charlest ^ Soulh 

PURPOSE: Brief periods of ischemia have been shown to protect the retina from 
potently damaging periods of i^i^ Tl.is phcnon^S^^^ 
precond.honmg or ischemic tolerance. In the present study the cellular chTnge s in level 
of heat shock protein (Hsp)27, -70. and -90 mRNA and expression of Hsp , rf Z^ltrcL 
« ed Wlth , sch jc precondition[ng wcre evaluated ^ ETH0 ^ n * n a ^ a ^ a 

henrftf" 3 , ?J ^ ng " EVanS and S P ra g-Dawley ra ts for 5 minutes ^ Ss were 
then left for 1 hour to 7 days, to allow the retina to reperfuse. Retinas were dissec" ed L 
mRNA and protein isolated, and Northern and Western blot m^^^M^A^i 
Ranges tn expression of Hs P 27, -70, and -90. ImmunohistochemiS sTu" e ^ 1 
identify retinal regions where Hsp changes occurred. Selected animals were sub ected to 
a second ischemic event, 60 minutes in duration, to correlate the chZl n exp e of 
Hsp with functional protection of the retina from ischemic injury SuLT In co^X. 
^d sham-treated animals retinal Hs P 27, -70, and -90 mRNAs iSSable FiT 

Tolu P T ndlt ' 0ning ' levds 0f Hs P 27 "RNA were elevated above control 
levels, and 24 hours later, mRNA levels increased 200% over basal levels Hsp27 

l^ZlZnT" TT d f ° r UP t0 72 h ° UrS and the " ^ to -turn totntL 
St.' P , P v " leVdS ^ mCreaSed by 200 ° /o over basal 'evels 24 hours after 

r C °f l ° mng ' remained 3t thiS ,evd for 72 hours > ■*> «™ return^ o con rol 
levels. In contrast, no consistent change in Hs P 70 or -90 mRNA or protein Svels was 
observed during the conn* of the study. Immunohistochemical s Ji^ZtZ that 
he increase m expression of Hs P 27 was localized to neuronal and non-neZnllSt 
SnSS, ' ^t^^ E,eCtroreti ^a P hy studies demonstrated a sTn g 

correlate be^een the protection of retinal function from ischemic injury and the 

SES£?£? 7f T CLUSI0NS : 111656 feSUltS Pf0Vide - vid^ te 1 ^ 
of Hs P 27 is a gene-specific event associated with ischemic.preconditionihg in the retina 
This increase ,n expression of Hs P 27 occurs in both neuronal and non-Sonal retinaf' 
ceHs, and appears to be one component of the neuroprotective J^^oT^ 
ischemic preconditioning in the retina. ' muutea oy 

PMID: 12601062 [PubMed - indexed for MEDLINE] 
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Enhanced expressions of arachidonic acid-sensitive taudetu-pore domain 
potassium channels in rat experimental acute cerebral ischemia. 

LLZB, Z^hjuyjJBX, LLLL, WangJCL. 

!f ated o QlineSe Academy of Medical Scienc <* and Peking Union 

Medical College, Beijing 100050, China. 

To further explore the pathophysiological significance of arachidonic acid-sensitive 
potass.™ channels, RT-PCR and Western blot analysis were used to investigate Z 
ZZZ^S ° f TRE £ Ch T dS in C0TteK ™ d ^-"P- * expe'rimel. 

afteTSI K T K 1 mRNA m ,u PP ocam P us sh ^ significant increases 2 h 

7*"? ^ ° CduS,0n (MCA °)- » the mRNA expression levels of 
the all three channel subtypes increased significandy 24 h after MCAO in cortex and 

^^% M 8311,6 ^ thG PK>te, ' n ex P ressi <™ of all the three channel proteins 
TWKlZ*T InCre T 24 h ^ MCA ° ia ° 0rt " tod ^campus, but oSy 
f™. 11 ^ ""T^ ex P res ^on 2 h after MCAO in cortex and hippocampus 
^unohts toctamcrf experiments Verified that all the three channel proLhs hL higher 
expression levels ,n cortical and hippocampal neurons 24 h after MCAO These resuhT 
5^.^ b «"™ TREK channels and acute cereb^cTemTa 
TREK channels m,ght provide a neuroprotective mechanism in the pathological process. 

PMID; 1 5652517 [PubMed - indexed for MEDLINE] 
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Increasing expression of tissue plasminogen activator and plasminogen 
activator inhibitor type 2 in dog gingival tissues with progressive 
inflammation. 

Lindberg P, Khmby B, Lecander I , Lgng_NP, Matsson L . 

Center for Oral Health Sciences, Malmo University, S-2I4 21 Malmo, Sweden. 
pia.luidberg@od.mah.se 

Urokinase and tissue-type plasminogen activators (u-P A and t-PA) are serine proteases 
that convert plasminogen into plasmin, which degrades matrix proteins and activates 
meta loproteinases. The P As are balanced by specific inhibitors (PAI-- 1 and P AI--21 
Local production of t-PA and PAI-2 was recently demonstrated in human gingival 
tissues. The aim now was to investigate the production and localization of t-PA and PAI- 
-2 in gingival ussues.from dogs in three welWefined periodontal conditions; clinically 
healthy gingiva, chrome gingivitis and an initial stage of ligature-induced loss of 
attachment. At the start of the experiment the gingiva showed clear signs of 
inflammation. Clinically healthy gingiva were obtained after 2 1 days period of intense 
oral hygiene^Attacliment loss was induced by placing rubber ligatures around the neck of 
some teeth. Biopsies were taken from areas representing the different conditions and 
prepared for m situ hybridization and immunohistochemistry. In clinically healthy 
gingiva both t-PA mRNA and antigen were expressed in a thin outer layer of the sulcular 
and junctional epitheha. No t-PA signals or staining were seen in connective tissue. Both 
mRNA signaling and immunostaining for t-PA were stronger in chronic gingivitis In 
areas with loss of attachment, t~PA mRNA as well as antigen were found inthe sulcular 
and junctional epithelia to a similar degree as in gingivitis. Occasionally the connective 
issue was involved, especially in connection with vessels. PAI-2 mRNA was seen in a 
thin outer layer of the sulcular and junctional epithelia in clinically healthy gingiva but 
no signals were seen in connective tissue. PAI-2 antigen was found primarily in the 
outer layer of the sulcular and junctional epithelia. Some cells in die connective tissue 
were stained. In gingivitis, PAI-2 signals were mainly found in the same locations but 
more intense and extending towards the connective tissue. Immunostaining was seen in 
the outer half of the sulcular and junctional epithelia as well as in the upper part of the 
connective tissue, close to the sulcular epithelium. In sites with loss of attachment PAI- 
2 mRNA was found throughout the sulcular and junctional epithelia, as was the antigen, 
fcSlSS? J ntCnSel >' u No PAr ~ 2 mRNA was seen in connective tissue; the antigen was 
found scattered, .especially near vessels. This study shows that the expression of both t- 
r A and PAI-2 increases with experimental gingival inflammation in the dog and 
rurthermore, the two techniques demonstrate a strong correlation between the 
topographical distribution of the site of protein synthesis and the tissue location of the 
^.tigens for both t-PA and PAI-2. The distribution correlates well with previous 
nntfings in humans. 



135: Mod Pathol. 2002 Scp;l5(9):979-87. 



Related Articles,. Links 



Effect of duration of fixation on quantitative reverse transcription 
polymerase chain reaction analyses. 

Macabco-Onp r ivr, Gum ngcr PC, DekkerN. McMillan A . Regezi JA Wone DT 
Jordan RC . ' ' 

Oral Pathology, Department of Stomatology, University of California San Francisco 
California 94143-0424, USA. 

Increasingly, th ere j s the need to analyze gene expression in tumor tissues and correlate 
these findings with clinical outcome. Because there are few. tissue banks containing 
enough frozen material suitable for large-scale genetic analyses, methods to isolate and 
quantify messenger RNA (mRNA) from formalin-fixed, paraffin-embedded tissue 
sections are needed. Recovery of RNA from routinely processed biopsies and 
quantification by the polymerase chain reaction (PCR) has been reported; however the 
effects of formalin fixation have not been well studied. We used a proteinase K-sal't 
precipitation RNA isolation protocol followed by TaqMan quantitative PCR to compare 
the effect of formalin fixation for 24, 48, and 72 hours and for 1 week in normal (2) oral 
epithelial dysplasia (3), and oral squamous cell carcinoma (4) specimens yielding 9 fresh 
and 36 formalin-fixed samples. We also compared mRNA and protein expression levels 
using immunohistochemistry for epidermal growth factor receptor (EGFR), matrix 
metalloproteinase (MMP)-l, p 21, and vascular endothelial growth factor (VEGF) in 15 
randomly selected and routinely processed oral carcinomas. We were able to extract 
RNA suitable for quantitative reverse transcription (RT) from all fresh (9/9) and 
formahn-fixed (36/36) specimens fixed for differing lengths of time and from all (15/15) 
randomly selected oral squamous cell carcinoma. We found that prolonged formalin 
fixation (>48 h) had a detrimental effect on quantitative RT polymerase chain reaction 
results that was most marked for MMP-1 and VEGF but less evident for p21 and EGFR 
Comparisons of quantitative RT polymerase chain reaction and immunohistochemistry ' 
showed that for all markers, except p21, there was good correlation between mRNA and 
protein levels. p2I mRNA was overexpressed in only one case, but protein levels were 
elevated in all but one tumor, consistent with the established translation^ regulation of 
p21 These results show that RNA can be reliably isolated from formalin-fixed, paraffin- 
embedded tissue sections and can produce reliable quantitative RT-PCR data However 
results for some markers are adversely affected by prolonged formalin fixation times. 

PMID: 12218216 [PubMed - indexed for MEDLINE] 
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W-l and Id-2 arc ovcrcxprcsscd iu pancreatic caucer and iu dysplastic 
lesions in chronic pancreatitis. 

Maruyama H , KlceffJ, WiidiS , Fricss H Bucl.lcr MW Israel MA . Korc M . 
Division of Endocrinology, Department of Medicine, University of California, Irvine, 

UoA, 

Id proteins antagonize basic helix-loop-helix proteins, inhibit differentiation, and enhance 
cell proliferation. In this study we compared the expression of Id- 1 , Id-2, and Id-3 in the 
normal pancreas, in pancreatic cancer, and in chronic pancreatitis (CP). Northern blot 
analysis demonstrated that all three Id mRNA species were expressed at high levels in 
pancreatic cancer samples by comparison with normal or CP samples. Pancreatic cancer 

W? 68 fi ! equentIy «>«q»essecl all three Ids, exhibiting a good correlation between Id 
mRNA and protem levels, as determined by immunoblotting with highly specific anti-Id 
7 f 7?- Immunoh >stochemi S tTy using these antibodies demonstrated the presence of 
taint id-I and Id-2 immunostaining in pancreatic ductal cells in the normal pancreas 
whereas Id-3 immunoreactivity ranged from weak to strong. In the cancer tissues, many 
of the cancer cells exhibited abundant Id-1, Id-2, and Id-3 immunoreactivity. Scoring on 
he basis ofpercentage of positive cells and intensity of immunostaining indicated that 
ld-1 and Id-2 were increased significantly in the cancer cells by comparison with the 
respective controls. Mild to moderate Id immunoreactivity was also seen in the ductal 
cells in the CP-hke areas adjacent to these cells and in the ductal cells of small and 
interlobular ducts in CP. In contrast, in dysplastic and atypical papillary ducts in CP Id-1 
and Id-2 immunoreactivity was as significantly elevated a<s in the cancer cells These 
findings suggest that increased Id expression may be associated with enhanced 
proliferative potential of pancreatic cancer cells and of proliferating or dysplastic ductal 
cells in CP. 
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ld-1 and ld-2 Are Overexpressed in Pancreatic 
Cancer and in Dysplastic Lesions in 
Chronic Pancreatitis 



Haruhisa Maruyama/ Jorg Kleeff,* Stefan WiloV . 
Helmut Friess,* Markus W. Buchler* 
Mark A. Israel * and Murray Korc* 

From the Division of Endocrinology, Diabetes* and Metabolism*. 
Departments of Medicine, Biological Chemistry and 
Pharmacology, University of California, Irvine, California; the ' 
Department of Visceral and Transplantation Surgery? University 
of Bern, Bern, Switzerland; and the Preuss Laboratory* 
Department of Neurological Surgery, University of California, 
San Francisco, California . 



Id proteins antagonize basic belix-Joop-helix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
. eration. In this study we compared the expression of 
Id-], Jd-2, and Id-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species were expressed at high levels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific anti-Id 
antibodies. Immunohistochemistry using these anti- 
bodies demonstrated the presence of faint Id- J and 
ld-2 immunostaining in pancreatic ductal cells in the 
normal pancreas, whereas ld-3 immujioreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant ld-1, 
ld-2, and ld-3 immunoreaclivity. Scoring on the basis 
of percentage of positive ceils and intensity of immu- 
nostaining indicated that ld-3 and ld-2 were increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id immuno- 
reaclivity was also seen in the ductal cells in the 
CP-like areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
ld-1 and Id- 2 immunoreactivity was as significantly 
elevated as in the cancer cells. These findings suggest 
that in creased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (Am J Pathol J9<J% 155:815-%22) 



Basic hetix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation; 1,2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1,2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes.*; 5 These tissue- 
specific bHLHs typically form heterodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A, £2-2, HEB, and other genes (also termed 
E-proteins). 6 "' 9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that lack 
the basic DNA binding domain. 1 - 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 1012 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo. 13-18 In addition, Id proteins seem 
to be required for cell cycle progression through G^/S 
phase in certain cell types, and interaclion between ld-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro? 9 ' 23 

Pancreatic cancer is the fifth leading cause of cancer 
death in. the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the over expression of a variety of myo- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation ol the p53 tumor 
suppressor gene. 25 We have recently reported that pan- 
creatic cancers over ex press the HLH protein ld-2, and 
that enhanced expression of this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory, disease that is characterized by 
dysplastic ducts, foci of proliferating ductal cells, acinar 
cell degeneration, and fibrosis.^ We now report that 
there is a five- to sixfold increase in ld-1 and Id-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-68 years), CP tissues from 13 males and 1 female 
(median age 42.1 years; range 30-56 years), and parv 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years; range 63-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
duodenopancreatectomy (Whipple/pylorus-preserving 
Whipple; n = 13), a left resection of the pancreas {n = 2), 
or a total pancreatectomy (n = 1) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
* (UICC) 6 tumors were stage 1 , 1 was stage 2, and 9 were 
stage 3 ductal cell* adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. Atl studies were ap- 
proved by the Ethics Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine. California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
: viously. 26 - 28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1 .2% agarose/1 .8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and. hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979-bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A , 
BamH\ 190-bp fragment of mouse 7S cDNA that, hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS trims and the resulting 
autoradiographs were scanned to quantify the intensity of 
the radiographic bands. 26 28 For each sample the ratio of 
!d mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used lor normal/ 




Normal Cancer 



• fit? If id-i 





Figure 1. mRNA expression of Jd-1, ld-2, 2nd ld-3. in pancreatic cancer and 
chronic pancreatitis. Total KNA (20 jig/lane) from six normal, eighl cancer- 
ous, and seven chronic pancreatitis tissue samples we're, subjected to North- 
em blot analysis using ^P-labeled cDNA probes (500,000 cphi/ml) specific 
for Jd-1, ld-2, and ld-3 r respectively-' A 7S cDNA probe (50,000 cpm/mt) was 
used as a loading and transfer control. Exposure limes of the norma l/cancer 
bJots were 1 day for all id piobes, and 2 days for the normal/CP blots. 
Exposure time was 4 hours for mouse 7S cDNA. By comparison with the 
normal samples, ld-1 and ld-3 mRNA levels were elevated in 8 and 9 cancer 
samples, respectively, whereas ld-2 was elevated in 6 cancer samples. 

cancer and normal/CP membranes. The median score for 
ld-1, ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific. San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of signilicance. 

Cell Culture and Western Blot Analysis 

PANC-1, MJA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell lines were obtained from ATCC (Manas- 
sas, VA). COLO- 357 human pancreatic cells were a gilt 
from Dr. R. S. Melzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MlA-PaCa-2. PANC-1) or 
RPMI (ASPC-1. CAPAN-1) supplemented with 10% fetal 
bovine serum, 100 U/ml penicillin, and 100 Mg/ mt strep- 
tomycin. For imrnunoblot analysis, exponentially growing 
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Figure 2. Densitometric analysis of Northern Mots. A uto» digraphs of 
Northern blots from 12 normal. J 4 CP r and J 6 pancreatic ameers were 
analyzed by densitometry. mRNA levels were determined by calculating ihe 
mto of uV optical density lor the respeciive Id mRNA species in relation to 
. the optical density of mouse 7S cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in uV cancer and CP samples, 
the same normal samples were used for normal/cancer and normal/CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues by 
triangles, and cancer tissues by squares. Data are expressed as median 
scores ± SIX By comparison wiih rbe normal samples, only the oncer 
samples exhibited significant increases: 6.}-fold (P < 0.0)) for Id-), fivefold 
(P < 0.01.) for Id- 2. and twofold KP = 0.027) for Id- 3- '* 



cells (60-70% confluent) were solubilized in lysis buller 
containing 50 mmoi/L Tris-HQ, pH 7.4, 150 mmol/L NaCI, 
1 mmol/L EDTA, 1 ^g/ml pepstatin A, 1 mmol/L phenyls 
. met hylsultony I fluoride (PMSF), and 1% Irilon X-100. Pro- 
leins were subjected to sodium dodecyl sulfate polyacryl- 
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Figure 3. Id mRNA and protein expression in pancreatic cancer cell lines. 
Upper panels:. Total RNA (20 j*g/lane) from 5 pancreatic cancer cell lines 
were subjected to. Northern blot analysis using 3 ?P-label«rd cONA probes 
(500,000 cpm/ml) specific for ld-1, )d-2, and Id-3 : respectively. Exposure 
times were ) day for all Id probes. Lower panels: Jmmunoblotting. Cell 
lysares (30 ng/lane) were subjected to SDS-PACE. Membranes were probed 
with specific Jd-1, Jd-2, and Id-3 antibodies. VisuahMiion was performed by 
enhanced chemilurwinescence. 



amide gel electrophoresis (SDS-PAGE), transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemifuminescence. 



immunohistochemistry 

Specific rabbit anti-human ld-1 (C-20), ld-2 (C-20), and 
Id- 3 (C-20; all from Santa Cruz Biotechnology, Santa 
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Figure 4. Normal and cancerous pancreatic (issues "ere subjected to immu- 
nosiaintng using highly specific aniMd] (A-C), ami-Id- 2 (D-F). and ami- ld-3 
(G-D antibodies as described in the Methods section. Moderate to strong )d-l 
immunoreactjvity was present in the cytoplasm of duct-like cancer ceils (A 
and C, left panel). In the normal pancreas there was weak ld-1 immunore-. 
activity in the ductal cells (B). Pieabsorption with the Id-l-specifk blocking 
peptide abolished the ld-1 immunoreactiviry (C, righl paneD. Strong )d-2 
immunoreactiviry was observed in the cytoplasm of the cancer cells thai 
exhibited duel-like structures CD and F. left panel), whereas in the normal 
pancreas, there was "only weak ld-2 immunoi cacti vity in the ductal cells (E). 
Preabsorption wiih ihe Id- 2- specific blocking peptide abolished the ld-2 
immunoreactivity (F. right paneQ. Moderate io strong kl-3 iimnuiwreaciiviiy 
was present in the due i- like cancer cells (G and I, left panel). Moderate to 
strong ld-3 immunoreactivity was also present in the ductal cells ol normal 
pancreatic tissue samples (H). ld-3 immunoreactivity was completely abol- 
ished by pieabsorplion with the )d-3 specific blocking peptide fi. righl 
panel)- A, D, and G constitute serial sections of a pancreatic cancer sample, 
revealing compression ol the three Jd proteins. Scale bars. 25 prn. 

Cruz, CA) polyclonal antibodies were used loi immunhis- 
tochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with ld-1 , Id 2, and ld-3, respec- 
tively, of human origin, as determined by Western 
blotting. Paraffin-embedded sections (4 pm) were sub- 
jected to immunostaining using the streplavidin-peroxi- 
dase technique. Where indicated, immunostaining for all 
three Id proteins was performed on serial sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes (23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex,- 
using diamindbenzidine tetrahydrochloride as the sub- 
strate. Sections were counterstained with Mayer's hema- 
toxylin. Preabsorption with ld-1- f ld-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were semiquantitatively analyzed as de- 
scribed previously. 29 ; 30 The percentage of positive 
cancer cells was stratified into four groups: 6, no cancer 
ce(ls exhibiting immunoreactivity; 1, <33% of the cancer 
cells exhibiting immunoreactivity; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal was also stratified into 
four groups: 0, no immunoreactivity; 1, weak immunore- 
activity; 2, moderate immunoreactivity; 3. strong immu- 
noreactivity. Finally, the sum of the results of the cell, 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 1 2 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1.2-kb ld-1 transcript and the 
1.6-kb Id2 mRNA transcript in 11 of the 12 normal pan- 
creatic samples, and the 1.3-kb ld-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A. 2). In the can- 
cer tissues, Id-i mRNA levels were elevated in 8 of 16 
samples, ld-2 mRNA levels were elevated in 9 of these 
samples, and ld-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A, 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometric analysis of 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in ld-1 mRNA levels, a fivefold 
increase (P < 0.01) in ld-2 mRNA levels, and a twofold 
increase (P = 0.027) in ld-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls . 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in Ihe expression levels of ld-1, ld-2, and 
ld-3, in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer celt lines by North- 
ern and Western blot analyses, ld-1 mRNA was present 
at varying levels in all 5 cell lines (Figure 3). ASPC-1, 
CAPAN-1. MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels of Id- 1 mRNA, whereas COLO- 357 cells 
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expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-ld-1 antibody 
confirmed the presence of the approximately 14-kd ld-1 
protein in the 4 cell lines that expressed high levels of 
Id-1 mRNA (Figure 3). Furthermore, the three ceil lines 
with the highest Id-1 mRNA expression (CAPAN-1, MIA- 
PaCa-2, and PANC-1) also exhibited the highest ld-1 
protein expression. Variable levels of the 1.6-kb id-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
C0LO357 arid MIA~PaCa-2 cells. Immunoblot analysis 
with a highly specific ant i- ld-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exception of PANC-1 
cells, tn which the 16-kd band was relatively faint (Figure 
3). With the exception of MIA-PaCa-2 cells, there was a 
good correlation between ld-2 mRNA and protein levels 
(Figure 3). Jd-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells, ld-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3). 
Immunoblot analysis with a highly specific ami- ld-3 anti- 
body revealed an approximately 14-kd band that was most 
abundant in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a taint 
ld-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with the exception ol PANC-1 cells, there was a good cor- 
relation between ld-3 mRNA and protein. levels. 



To determine the localization of ld-1, ld-2, and ld-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong ld-1 immunoreactivity was present in 
the cancer cells in 9 of 10 Randomly selected cancer 
samples. An example of moderate ld-1 immunoreactivity 
is shown in Figure 4A, and of strong immunoreactivity in 
Figure 4C (left panel). In contrast, in the normal pancreas, 
faint ld : 1 immunoreactivity was present only in the ductal 
cefls of pancreatic ducts (Figure 4 B, arrowheads). Pre- 
absorption with the ld-1 -specific blocking peptide com- 
pletely abolished the ld-1 immunoreactivity (Figure 4C, 
right panel). The cancer cells also exhibited strong ld-2 
(Figure 4, D and F, tell panel) and moderate to strong fd-3 
immunoreactivity. An example ol moderate ld-3 immuno- 
reactivity is shown in Figure 4G, and 'of strong immuno- 
reactivity in Figure 4 1 (left panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas ld-3 immuno- 
reactivity in these cells was more variable and ranged 
from moderate to occasionally strong (Figure 4FI). Islet 
cells and acinar cells were always devoid of Id immunore- 
activity. Preabsorption ol the respective antibody with the 
blocking peptides specific lor ld-2 (Figure 4F, right panel) 
and ld-3 (Figure 41, right panel) completely abolished im- 
munoreactiviiy. Analysis of serial pancreatic cancer sec- 
tions revealed that there was often colocalizaiion ol the 
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Figure 6. immimuhistochemistry of atypical papillary epiihclium in CP tissues. Serial scciion analysis of some CP samples repealed ihe presence of large duci-like 
Structures with atypical pupillary epithelium. Mild to moderate ld-1 (A) and Id- 2 (B) imrmtnoreactivity and weak ld-3 (C) immunoreaclivity was present in the 
cytoplasm of the cells forming these large ducts wiih papillary stmciures. Some CP samples also exhibited moderate ld-3 immunoreaclivity in these cells (D). Scale 
bar, 25 nm. 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and G. 

ld-1, ld-2, and ld-3 immunoreactivity was also present 
at moderate levels in the cytoplasm of ductal cells within 
CP-like areas adjacent to the cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. In 4 of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular duels, all ol which exhibited mod- 
erate to strong ld-1, ld-2. and ld-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis ol one of those CP samples revealed 
mild to moderate ld-1 and ld-2 immunoreactivity and 
weak ld-3 immunoreactivity in the cells of these atypical 
papillary ducts (Figure 6. A-C). In contrast, in some ol 
these CP samples, moderate to strong ld-3 immunoreac- 
tivity was also observed (Figure 60). However, most of 
the ductal cells forming the typical ductular structures ol 
CP, such as large interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id immunoreactivity (data no! shown). 



The immunohistochemical data for ld-1, ld-2, and ld-3 
are summarized in Table 1. In the case of ld-1 and ld-2, 
the cancer cells as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast, due to the marked variability in ld-3 immuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family of HLFI transcription (actors 
that are important regulators ol cellular differentiation and 
proliferation. 12 To date, four members of* Ihe human Id 
family have been identified. 1TO_i;? "Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic stimuli, 1931 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 1t> • ^? " ^ ' , The forced expression 
of ld-1 in mouse small intestinal epithelium results in 
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Table 1. Histological Scoring 









ld-1 


ld-2 


ld-3 


Normal (n = 
Cancer {n = 
CP (n = 9) 


6) 
10) 


Ductal cells 
Cancer cells 

Typical CP lesions (n = 9) 
Dysplastic ducts {n = 4) 
Atypical papillary ducts (n = 5). 


2.0 ± 0.4 
4.5* ± 0.5 
. 2.7 ± 0.5 
5.3* ± 0.2 
4.4* ± 0.2 


2.3 ± 0.2 
5.2 s ±0.3 

3.1 + 0.6 
5.8* ± 0.2 
5.2* + 0.2 


2.5 + 0.9 
4.5 + 0.6 
3.4 + 0.7 
5.3 + 0.4 
5.0 + 0.4 



Scoring of Ihe histological specimens was periormeo as aescrroeo in me rauenis anu iv»eu kjuc m^uum. <^ »^ » 

of samples Mealed in parenthesis. P values are based on comparisons with the respective controls in the normal samples. 
\ P < 0.02; + P .< 0.01; *P = 0.004; 5 P » 0.001. 



adenoma lormation in these animals. 35 The growth-pro- 
moting ellects of Id genes are thought to occur through; 
several mechanisms. For example; ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20,21 and ld-1 and ld-2 
can antagonize the bHLH-rhediated activation of known 
inhibitors of cell cycle progression such as the cyclin- 
dependent kinase inhibitor p2i. 23 

in the present study, we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2. and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence. of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and ld-2 immunoreactivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1, ld-2, 
and ld^3 immunoreactivity. In the cancer samples, all 
three Id proteins often colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of. ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the hall-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 ol 5 
cell lines. In view ol the fact that two possible initiation 
codons have been reported lor the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosis 
pathways. Thus, these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 38 and 
abnormal resistance to Fas-ligand-medtated apopto- 
sis. 39 It has been shown recently that forced constitutive 
expression o! Id genes together wiih the expression of 
anti-apoptotic genes such as BcU2 or BclX L can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells in vivo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic, cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
reslricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, there was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples. Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
* general population. 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- . 
creatic cancer, such as the adenoma-carcinoma se- 
quence ol colorectal carcinoma, 41 it is generally ac- 
cepted that K-ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process 37 - 41 ~ 43 increased Id ex- 
pression may contribute to malignant transformation ol 
cultured cell lines in vitro 11 and has been linked to cell 
invasion in a murine mammary epithelial cell line. 44 In 
view of the current findings that ld-1, ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic duels in CP. these observations raise the 
possibility that elevated levels of ld-1, ld-2. and, to a 
lesser extent, ld-3 may represent relatively early markers 
of pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancreatic cancer. 
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We have previously reported the construction of a constitutively active luteinizing 
hormone receptor by covalently linking a fused heterodimeric hormone to the 

ZZ^mZ f th a ? prot r coup,ed receptor - ™ s y° ked ho-o n t eceptor 

complex (YHR) was found to produce high levels of cAMP in the absence of exogenous 
hormone. Stable hues expressing YHR were generated in HEK 293 ceUs "o obi fET 

it was found that YHR expression was greatly reduced. Herein we describe the 

^iTrV 0 ? 101131 'r ° f HEK 293 <* US W Which * e 5££ is under 

hght control of YHR expression both by dose and time of incubation with doxvcvcline 
These experiments demonstrated a good correlation between c^p^^SSSr 

Sltl r AMP Pr ° d r Cti0n - MQre0Ver ' ** reductio « * -Pr^ on 

followmg doxycychne removal revealed that YHR mRNA and protein decayed at similar 
mtes, again suggesting a strong linkage between mRNA and protein levTrhe 

tntZ I g P P aSSOC,ated Wl(h institutive receptor activation and may prove 
to be advantageous in developmental studies with transgenic animals. 
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c-fos and estrogen receptor gene expression pattern in the rat uterine 
epithelium during the estrous cycle. 

Me n doza-Rodr.>,, CT C.A Mcrchant-Larins IT Segnra-Valdez Ml . Mnr^nn- 
MendozaN, CruzME, Arteaga -Lopez P, Camacho-Arrovo i Dominlu^R. Cerhnn 

Facultad de Quimica, Universidad Nacional Autononia de Mexico, Ciudad Universitaria 
Coyoacan 04510, Mexico, D.F., Mexico. ' 

Different studies in oyariectomized estrogen treated animals support the idea that c-fos 
plays a role in the proliferation of uterine epithelial cells. However, these studies invite us 
to reassess the role played by c-fos in epithelial cell types of the endometrium during the 
estrous cycle. The present study was undertaken to determine the c-fos and estrogen 
receptor (ER) gene expression pattern in the rat uterine epithelium during the estrous 
cycle in which natural and cyclic changes of steroid hormones occur, and correlate these 
changes with the proliferation status of this cellular types. Proliferation was assessed 
during the estrous cycle using bromodeoxyuridine incorporation to DNA. ERalpha and 
beta proteins were assessed by immunohistochemistry. The regulation of c-fos gene 
expression in the uterus of intact animals during the estrous cycle was evaluated using 
both in situ hybridization and immunohistochemistry. Estradiol (E(2)) and progesterone 
(P(4j) plasma levels were assessed by radioimmunoassay. The results indicated that 
luminal (LE> and glandular epithelia (GE) presented maximal proliferation during the 
metestrus (M) arid the diestrus (D) days. However, during the proestrus (P) day only LE 
presented proliferation, and during the estrus(E) day only the stromal cells proliferated, 
A marked immunostaining for ERalpha was detected in both LE and GE cells during the 
early phases of the cycle but diminished on the P and the E day. In contrast, ERbeta was 
undetectable in both epithelia during all stages of the cycle. The highest c-fos mRNA 
level was detected in both epithelia on the M day, followed by a significant reduction 
during the other days of the cycle. The highest protein content was observed on the M 
and D days, and the. minimal value was detected on the E day. The c-Fos protein level in 
LE was increased during M and D days, presenting a high correlation with the cellular 
proliferation pattern of this cell type. In conclusion, the overall results indicate that c-Fos 
protem presented a good correlation with uterine epithelial cell proliferation of LE In the 
case ot GE the same tendency was observed, although no significant correlation was 
found. Both in LE and GE, c-fos mRNA did not strictly correlate with its protein levels 
C-tos seems to have a postranscriptional regulation in uterine epithelial cells during the 
rats estrous cycle. .Copyright 2003 Wiley-Liss, Inc. 
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(3HJMK-801 binding and the inRNA for the NMDAJEU subuait of the 
NMDA receptor arc differentially distributed in human and rat forebrain. 

Mconi P, Mugnaini M . Bunncmanti BH . Trist DG . Bowery NG . 

Department of Pharmacology, Medical School, University of Birmingham, UK. 
meohip@novell5.bham.ac.uk 

The distributions of [3H]MK-80T binding and the NMDA NR1 subunit mRNA were 
studied using receptor autoradiography and in-situ hybridization in rat and human brain 
whole-hemisphere coronal sections. Receptor protein detected by radioligand 
autoradiography and the mRN A for the key subunit of the receptor presented similar 
distributions in the forebrain, with a few areas showing an imbalance between the levels 
of mRNA and receptor protein. Human frontal cortex showed a relative abundance of 
NMDAR1 mRNA as compared to [3HJMK-801 binding. The same area in rat brain did 
not show any difference in the two distributions. In comparison, the rat claustrum 
presented a relative excess of NMDAJU mRNA which was not detected in human 
sections. Human caudate nucleus exhibited relatively high levels of [3H]MK-80l binding 
that were unmatched in rat caudate. The hippocampi of either species presented similar 
levels of [3HJMK-801 binding and NMDAR1 mRNA, but when the two signals were 
measured m specific subfields of the hippocampal formation/the differential distribution 
of the two signals reflected the anatomy of hipppcampal connections assuming a 
preferential dendritic distribution for MK-801 binding. Interestingly, rat and human 
hippocampi also showed some important species-dependent differencein the relative 
distribution of the receptor protein and mRNA. The data presented show an overall good 
correlation between the mRNA for the key subunit of the NMDA receptor and the 
functional receptor detected with radioligand binding and highlight the presence of local 
differences m their ratio. This may reflect different splicing of the mRNA for the 
NMDAR I subunit in specific brain areas of rat and human. The species-dependent 
differences in the relative distribution of the mRNA for the key subunit of the NMDA 
receptor and that of a marker of functional receptors also highlights important differences 
in the NMDA function in rat and human brain. 
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Abstract 

Transition initiation is regulated in response to 
nutrient availability and mitogenic stimulation and is 
coupled with cell cycle progression and cell growth. 
Several alterations in translationai control occur in 
cancer. Variant mRNA sequences can alter the 
transnational efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in 
the expression or availability of components of the 
translationai machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translationai 
activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translationai control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit, the differences in gene expression between 
cancer cells and norma) cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translationai regulation has been best recog- 
nized among developmental biologists, because transcription . 
does not occur in early embryogenesis in eukaryotes. For ex- 
■ ample, in Xenopus, the period of transcriptional quiescence 
continues unt3 the embryo reaches midbJastula transition, the 
4000-cel) stage. Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a transtetionally 
inactive, masked form. The mRNA are transtetionally activated 
at appropriate limes during oocyte maturation, fertilization, and 



early embryogenesis and thus, are under strict translationai 
control 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a . conse- 
quence of mitogenesis. Until recently, however,, little was 
known about the alterations In mRNA translation in cancer, 
and much is yet to be cSscovered about their role in the 
development and progression of cancer. Here we review the 
basic principles of translationai control, the alterations en- 
countered h cancer, arid selected therapies targeting transla- . 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translationai Control 
Mechanism of Translation Initiation 
Translation initiation is the main step hd translationai regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the 4 OS and 60S ribosomal subunits are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors into an SOS ribosome at the start 
codon of the mRNA (Fig. 1). The 5' end of eukaryotic mRNA is 
capped, te., contains the cap structure m 7 GpppN (7-melhyf 
gua/Kisine-tiiphospr^ Most translation in 

eukaryotes occurs in a cap-dependent fashion, I.e., the cap is 
specifically recognized by the elF4E, 3 which binds the 5' cap. 
The elF4F translation initiation complex is then formed by the 
assembly of elF4E, the RNA heficase elF4A. and eIF4G, a 
scaffolding protein that mediates the binding of the 4 OS ribo- 
somal suburut to the mRNA molecule through interaction with 
the elF3 protein present on the 40S ribosome. elF4A and elF4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA. The 43S initiation complex (40S/elF2/MeMRNA/ 
GTP complex) scans the mRNA in a 5'-^3' direction until it 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 48$ initi- 
ation complex The initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosome. 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the picomavirus RNA, is cap inde-' 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require elRE. Either the 43S complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the encephatonrryocarditis virus, or it can 
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Fig. 1. Translation initiation in eukaryoles. The 4£-BPs are rryperpnos- 
phorytated to release GIF4E so that a can interact with the 5' cap. and the 
eIF4F initiation complex is assembled. The interaction of pory(A) binding 
protein with the initiation complex and ctrcutarization ol the mRNA is not 
depicted m the diagram. The secondary structure of the 5' UTH is melted, 
the 40S ribosomal subunil is bound to elF3, and the ternary complex 
consisting of elF2. GTP. and the Met-tRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a.5'-*3' direction until an AUG start codon 
is found in the appropriate sequence context The initiation factors are 
released, and the large ribosomal suburul b recruited. 



initially attach to the IRES and then reach the initiation codon by 
scanning or transfer, as is the case with the pc*Bowus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations In the 
expression or phosphorylation, status of the various factors 
involved. Key components in translational regulation that 
may provide potential therapeutic targets follow. 

e»F4E. elF4E plays a central rote in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate- limiting component for initiation of cap- 
dependent translation. elF4E may also be involved in mRNA 
splicing, mRNA 3'*processing, anc j m RNA nucleocytoplas- 
mic transport (2). elF4E expression can be increased at the 
transcriptional level in response to serum or growth factors 
(3). elF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



lational machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth factor-2, and cycBn D1 £2, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. e!F4E phosphorylation is mediated by the 
mitogerv-activated protein kinase- interacting kinase 1 , which 
is activated by the mitc^ervactivated pathway activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mitogen-actfvated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin II, sre kinase overexpression, and ras over- 
expression, lead to etF4E phosphorylation (14). The phos- 
phorylation status of elF4E is usually correlated , with the 
transitional rate and growth status of the ceO; however, 
elF4E phosphorylation has also been observed In response 
. to some cellular stresses when translations! rates actually 
decrease (15). Thus, further study is needed to understand 
the effects of elF4E phosphorylation on eIF4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the e!F4E-binding pro- 
teins (4E-BP, also known as PHAS-I). 4E-BPs compete with 
elF4G for a binding site in elF4E. The biding of elF4E to the 
best characterized eF4E-blnding protein, 4E-BP1, is reguv 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hyperphosphorylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and II, ihterleukm 3, granulorjyte-macrophage colony-stim- 
ulating Jactor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-6P1 and 
to.decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of .nutrients or growth factors results* . 
in 4E-BP1 dephosphorytation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of nbosomal 40S protein 
S6 by S6K is thought to ptay an important role in translational . 
regulation. S6K mouse embryonic ceils proliferate more 
slowly than do parental cells, demonstrating that S6K has a - 
positive influence on cell proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ofio^pyfinrkfine tract (5' TOP) found at the 5' irm of r »Tx)SomaI 
proteb mRNAs and other mRNAs coding for components of 
the translational machinery. Phc«phorytation of S6K is regu- 
lated in part based on the availability of nutrients (1 8, 1 9) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-tike growth factor I (20). 

elF2a Phosphorylation. The binding of the initiator tRNA 
to the small nbosomal unit is mediated by translation initia- 
tion factor elF2. Phosphorylation of the tr-subunit of elF2 
prevents formation of the elF2/GTP/Met-tRNA complex and 
inhibits global protein synthesis (21, 22). elF2a is phospho- 
rylated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
eIF2a is phosphoryiated by heme-regutated inhibitor, nutrient- 
regulated protein kinase, and the IFN-induced. double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The mTOR Signaling Pathway. The macrofide antibiotic 
rapamycin (SiraDmus; Wyeth-Ayerst Research, Cotlegevitle, 
PA ) has been the subject of intensive study because H in- 
hawts signal transduction pathways involved in T-ceff activa- 
tion. The rapamycin-sensfflve component of these pathways 
is-mTOR (also. called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate G, 
progression and translation in response to nutrient avaflabi^ 
ity (24). mTOR is a serine-threonine kinase that modulates 
translation initiation by altering the phosphorylation status of 
4E r BP1 and S6K (Fig. 2; Ref. 25). 

is phosphoryteted on multiple residues. mTOR pbos- 
phorytates the Thr-37 and Thr-46 residues of 4E-BP1 m vitro 
(26); however, phosphorylalion at these sites is not associated 
with a loss of elF4E binding. Phosprwytation of Thr-37 and 
• Thr-46 is required for subsequent phosphorylation at several 
COOH^ terminal, serum-sensitive sites; a combination of these 
Phosphorylation events appears to be. needed to inhibit the 
bincfing of 4E-BP1 to eIF4E £5). The product of the ATM gene, 
p3a/MSK1 pathway, and protein, kinase Co also play a rote in 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt PTEN is a phosphatase that 
negatively regulates P13K signaling. PTEN null cells have 
constituuVery active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
P13K inhibitors wortmannin and LY2940G2 and by mTOR 
inhibitor rapamycin (24). AM phosphorylates Ser-2448 in 
mTOR in vitro, and this site is phosphorylated upon Akt 
activation in vivo (31^33). Thus, mTOR is regulated by the 
PI3K/AM pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR autophosphorylation is blocked by wort- 
rrannm but not by rapamycin (34). This seeming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechamsm other than intrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibilion of a phospha- 
tase. Treatment with cafycufin A, an inhibitor of phosphatases 1 
and 2A, reduces rapamycin-induced dephosphorybtion of 4E- 
BP1 and S6K by rapanrrycin f35). PP2A interacts with full-length 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamytia mTOR phosphorylates PP2A 
in vitro; however, how this process atters PP2A activity is not 
known. These results are consistent with the model that phos- 
• Phorytation of a r^hosphatase by mTOR prevents dephospbo- 
rybtion of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rrvation and rapamycin block inhibition of the phosphatase by 
mTOR y 

Poryaderrytation. The poly(A) tail in eukaryotic mRNA is 
important in enhancing translation initiation and mRNA sta- 
bility. Poryadfenybtion plays a key role in regulating gene 
expression during oogenesis and early embryogenesis. 
Some mRNA that are translationaHy inactive in the oocyte are 
potyadenytated concomitantly with translations! activation in 
oocyte maturation, whereas other mRNAs that are transit 
tionalty active during oogenesis are deadenylated and trans- 
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Ftg. 2. Regulation of translation WtiaBon by signal transduction path- 
ways. Signaling via p38, extracellular signal-related kinase; PBK, and 
mTOR can aA activate translation initiation. 



tationatly silenced (36-38). Thus, control of pory(A) tail syn- 
thesis is an important regulatory step in gene expression. 
The 5' cap and poty(A) tail are thought to function synergis- 
ticalry to regulate mRNA translational elficiency (39, 40). 

RNA Packaging. Most RNA- binding proteins are assem- 
bled on a transcript at the time of transection, thus deter- 
mining the transJational fate of the transcript (41). A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to play a role in restricting the recruitment of 
mRNA to the translational machinery (41-43). The major 
rnRNA-associated protein, YB-1, destabilizes the interaction 
of etF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results in translalional repression in vivo (44). Thus, 
alterations in RNA packaging can also play an important rote 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer 
variations in mRNA sequences that increase or decrease trans- 
lational efficiency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects the translation of an ind>- 
vidualmRNA mat may play a rote in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an increase in the overall rate of protein synthesis, and the 
translational activation of several mRNA species. 

Variations in mRNA Sequence 

Variations in mRNA sequence affect the translational effi- 
ciency of the transcript. A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
in the 5' UTR, can aller its translational efficiency, as seen in 
the following examples. 
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c-myc Saito et at proposed that translation of full-length 
c-myc is repressed; whereas in several Burkitt lymphomas 
that have deletions of the mRNA 5' UTR, translation of c-myc 
is more efficient (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap-independent as welt as a 
cap-dependent mechanism (46. 47). In patients with multiple 
myeloma, a C-*T mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced Ntiation of translation 
via internal ribosomal entry (49). 

BRCA1. A somatic point mutation (1 17 G-»C) in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild- type or mutated 
BRCA1 5 r UTR and a downstream lucif erase reporter dem- 
onstrated a decrease in the translationa! efficiency with the 5' 
UTR mutation. 

. Cyclin-dependent Kinase Inhibitor 24. Some inherited 
melanoma kindreds have a G-*T transversion at base -34 
of cyclin-dependent kinase inhibit or-2A, which encodes a 
cyclin-dependent kinase 4/cyclin- dependent kinase 6 kinase 
inhibitor important in G, checkpoint regulation (51). This- 
mutation gives rise to a novel AUG translation initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations in splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, uttirnately impacting translational efficiency. 

ATM. The ATM gene has four noncoding exons in its 5' 
UTR that undergo extensive alternative splicing (52). The 
contents of. 12 different 5' UTRs that show considerable 
diversity in length and sequence have been identified. These 
divergent 5' leader sequences play an important role in the 
translational regulation of the ATM gene. 

mdm. In a subset of tumors, overexpression of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1. tn a normal mammary gland, BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa). whereas 
in sporadic breast cancer tissue, BRCA1 mRNA is expressed 
with a longer leader sequence (5' UTRb); the translational 
efficiency of transcripts containing 5' UTRb is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-pa TGF-03 mRNA includes a 1.1-kb 5' UTR, which 
exerts an inhibitory effect on translation. Many human breast 
cancer cell lines contain a novel TGF-03 transcript with a 5' 
UTR that is 870 nucleotides shorter and has a 7-fold greater 
translational efficiency than, the normal 7GF-03 mRNA (55). 

Alternate Poryadenylation Sites. Multiple poryadenyl- 
ation signals leading to the generation ol several transcripts 
with differing 3' UTR have been described lor several mRNA 
species, such as the RET proto-oncogene (56). ATM gene 
(52), tissue inhibitor of metaIloproteinases-3 (57). RHOA 



proto-oncogene (58), and calmodulin-l (59). Although the 
effect of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect translational recruitment. The role of these alter- 
ations in cancer development and progression is unknown. 

Alterations in the Components of the 
Translation Machinery 

Alterations in the components of translation machmery can 
take many forms. 

Overexpresssion of elF4E. Overexpression of elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Polunovsky ef at. (62) 
found that elF4E overexpression substitutes for serum and 
individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broad 
spectrum of transformed cell lines (63). elF4E levels are 
elevated fa all ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that e)F4E overex- 
pression can be valuable as a prognostic marker. eIF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to III breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way in a prospective trial (67). However, in a different 
study, elF4E expression was correlated with the aggressive 
behavior of nooHodgkin's lymphomas (68). In a prospective 
analysis of patients with head and neck cancer, elevated tevels 
of elF4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest tbat'elF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that elF4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations In a 
number of other initiation factors have been associated with 
cancer. Overproduction of e(F4G, similar to elF4E, leads to 
malignant transformation in vitro (69). elF-2a is found in 
increased levels in bronchioloatveolar carcinomas ol the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the elF3-p1 10 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- 
opment and progression of cancer, if any, is not known. 

Overexpression ol S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer ceil line, com- 
pared with normal mammary epithelium (74). In a study by 
Barfund ef at. (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' pofy(A) syn- 
thesis. PAP is overexpressed in human cancer cells com- 
pared with normal and viralfy transformed cells (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and. in mammary tumor cytosote, was 
found to be an independent factor for predicting survival (76% 
Little is known, however, about how PAP expression or ac r 
tjvity affects the translational profile. 

Alterations in RNA-binding Proteins. Even less is known 
about alterations in RHA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA-bindng protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cefl lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases crtenwesistance by en- 
hancing the transcription of a multidrug resistance gene (80). 



Activation of Signal transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene P7EN, which leads to the activation of the Ft3K/Akt path- 
way. Activation of PI3K and Akt induces the oncogenic trans- 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (81), 
. A mutant Akt that retains kinase activity but does not phos- 
phorytate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of P.K3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin-like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translational control. 
For example, HER2/neu mRNA is translationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a cell type-independent manner and 
by a distinct ceH type-dependent mechanism that increases 
translational efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, it is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/nev gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
ond Tumstat'm 

Rapamycin inhibits the proliferation of lymphocytes. It was 
initially developed as an immunosuppressive drug for organ 



transplantation. Rapamycin with FKBP 12 (FK50&-binding 
protein, M ( 12,000) binds to mTOR to inhibit its function. 

Rapamycin causes a small but significant reduction in the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap-iraiependent translation (17, 86). 

The rapamyrin-sensfiive signal transduction pathway, acti- 
vated during malignant transformation and cancer progression, 
is now being studied as a target for cancer therapy (87). Pros- 
tate, breast, small cefl lung, glioblastoma, melanoma, and T-cefl 
leukemia are among the cancer fines most sensitive to the 
rapamycin analogue CO-779 (Wyeth-Ayerst Research; Ref. 
87). In rhabo^rnycosarcoma cefl fines, rapamycin is either cyto- 
static or cytoddal, depending on the p53 status of the cell; p53 
wild-type ceBs treated with rapamycin arrest in the 6, phase 
and maintain their viability, whereas p53 mutant ceRs accumu- 
late in G, and undergo apoptosis (88, 89). In a recently reported 
study using human primitive rteur ectodermal tumor and 
meduBoblastoma models, rapamycin exhibited more cytotox- 
icity in combination with cisplatin and camptothecin than as a 
single agent. In vivo, CCi-779 delayed growth of xenografts by 
160% after 1 week of therapy and 240% after 2 weeks. A single 
high- dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in vivo was 1.3 times greater, with 
cisplatin in combination with CCl-779 than with cisplatin alone 

(90) . Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycin analogues CCl-779 and RAD001 (Novartis, 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycin on lymphocyte proliferation, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCl-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in —24 h after therapy 

(91) . The principal toxicities of CCl-779 have included der- 
matotogical toxicity, myetosuppression, infection, mucositis, 
diarrhea, reversible elevations in liver function tests, hyper : 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCl-779 have been conducted in 
advanced renal cell carcinoma and in stage lll/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 96). 
Thus, CCl-779 has documented preliminary clinical activity in 
a previously treated, undetected patient population. 

Active investigation is under way inlo patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCl-779 in PTEN-nuil tumors (30. 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCl-779, although only two of these lines 
lached PTEN (97) There was, bowevei, a positfve cot relation 
between Akt activation and CCl-779 sensitivity {97), This 
correlation suggests that activation of the PI3K-Akt pathway, 
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regardless of whether it is attributable to a PTEN mutation or 
to overexpression of receptor tyrosine kinases, makes can- 
cer cetl amenable to mTOR-drrected therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with raparnycin resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of angiogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferation as a result of mTOR inhfoition in these cells or by 
inhibition of translation of such proangjogenic factors as 
vascular endothelial growth factor in tumor cells (99, 100). 

The angiogenesis inhtoitor tumstatin, another anticancer 
drug currently under study, was also found recently to inhibit 
translation in endothelial cells (101). Through a requisite in- 
teraction with integrin, tumstatin inhibits activation of the 
P13K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR- signaling pathway. 

Targeting elF2ce £PA, Clotrimazole, mda-7, 
and Flavonoids 

EPA is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
intracellular stores while inhibiting their refilling, thereby acr 
tivating PKR. PKR, in turn phosphorylates and inhibits e(F2a, 
resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in vivo, inhibits cell growth through 
depletion of Ca 2 ^ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cycfins A, E, and Dt, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mda7) induces apoptosis in marry cancer cells including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of elF2a and. induction of apoptosis (110). 

Flavonoids such as genistein and quercetin suppress tu- 
mor cell growth. AH three mammalian elF2a kinases, PKR, 
heme- regulated inhibitor, and PERK/PEK, axe activated by 
flavonoids, with phosphorylation of e!F2a and inhibition of 
protein synthesis (111). 

Targeting elF4A and elF4E: Antisense RNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
of melanoma celts (1 12). Sequestration of elF4E by overexpres- 
sion ot 4E-BP1 is proapoptotic and decreases tumorigenicity 
(113, 114). Reduction of eIF4E with antisense RNA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of lumor doubling times (7). Antisense elF4E RNA treat- 



ment also reduces the expression of arrgjogenk: factors (115) 
and has been proposed as a pc^errtialadiu^ therapy 
and neck cancers, particularly when elevated e*F4E is found in 
surgical margins. Small molecute inhibftors that bind the elF4G/ 
4&BFn-binding domain of elF4E are proapoptotic (116} and 
are aJso being actively pursued. 

Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5* UTR These mRNA would thus be at a competitive 
disadvantage in norma! ceils and not translate wen, whereas in 
cancer cells, they would translate more efficiently. For example, 
the intrxxiuction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex vims type- 1 thymidine 
kinase gene, allows for selective translation of herpes simplex 
virus type-1 thymidine kinase gene in breast cancer cell fries 
compared with jx>rrnal mammary ceD lines and results in se- 
lective sensitivity to ganciclovir (117). 

Toward the Future 

Translation is a crucial process in every eel However, several 
alterations in transitional control occur in cancer. Cancer cells 
appear to need an aberrantly activated transfational state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the transfational ma- 
chinery, such as efF4E, and signal transduction pathways in- 
volved in translation initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some pre&ninary activity in clinical trials. It is possible 
that with the deyeloprnent of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most likely to achieve clinical utility in 
combination therapy. In the interim, pur increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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OverexpressioQ of cheniokiues, fibrogenic cytokines, and myofibroblasts in 
human membranous nephropathy, 

Mczzano SA , D rogue tt MA, Burgos ME , Ardiles LG , Aros CA . Caorsi I , Egido J . 

Division of Nephrology, School of Medicine, Universidad Austral, Valdivia, Chile. 
sme2zan0@uach.cl 

Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in human 
membranous nephropathy. BACKGROUND: Proteinuria plays a central role in the 
progression of glomerular disease, and there is growing evidence suggesting that it may 
determine tubular cell activation with release of chemokines and fibrogenic factors, 
"leading to interstitial inflammatory reaction. However, most studies on this subject have 
been performed in experimental models, and the experience in human kidney biopsies 
has been scarce. We analyzed the tissue sections of patients with idiopathic membranous 
nephropathy (IMN), a noninflammatory glomerular disease that may follow a progressive 
disease with heavy persistent proteinuria, interstitial cell infiltration, and decline of renal 
function. METHODS: Paraffin-embedded biopsy specimens from 25 patients with IMN 
(13 progressive and 12 nonprogressive) were retrospectively studied by 
immunohistochemistry [monocyte chemoattractant protein- 1 (MCP- 1 ), regulated on 
activation normal T-cell expressed and secreted chemokine (RANTES), osteopontin 
(OPN), platelet-derived growth factor-BB (PD-GF-BB)] and in situ hybridization [MCP- 
I, RANTES, PDGF-BB, transforming growth factor-beta! (TGF-betal)]. Moreover, we 
studied the presence of myofibroblasts, which were identified by the expression of alpha- 
smooth muscle actin (alpha-SMA), the monocytes/macrophages (CD68-positive cells), 
and T-cell infiltration (CD4+ and CD8+ cells). All of the patients were nephrotic and 
without treatment, at time of the biopsy. RESULTS: A strong up-regulation of MCP-l , 
RANTES, and OPN expression was observed, mainly in tubular epithelial cells, with a 
significant major intensity in the progressive IMN patients. A strong correlation between 
the mRNA expression and the corresponding protein was noted. The presence of these 
chemokines and OPN was associated with interstitial cell infiltration. TGF-beta and 
PDGF were also up-regulated, mainly in tubular epithelial cells, with a stronger 
expression in the progressive IMN, and an association with the presence of 
myofibroblasts was found. CONCLUSIONS: Patients with severe proteinuria and 
progressive IMN have an overexpression in tubular epithelial cells of the chemokines 
MCP-1, RANTES, and OPN and the profibrogenic cytokines PDGF-BB and TGF-beta. 
Because this up-regulation was associated with an interstitial accumulation of 
mononuclear cells and an increase in myofibroblast^ activity, it is suggested that those 
mediators are potential predictors of progression in IMN. Finally, based on experimental 
. data and the findings of this article, we speculate that severe proteinuria is the main factor 
responsible for the up-regulation of these factors in tubular epithelial cells. 



Decreased uacoupling protein expression and intramyocytic triglyceride 
depletion m formerly obese subjects. 
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r^ CTIVE: T ° eXami " e the muscular uncoupling protein expression 2 (UCP2) and 

g^e expression in morbid obese subjects before and after bariatric surgery rbilio- 
P ancreaticdivers,on(BPD)]. RESEARCH" METHODS AND PROCEDURE?- meve 

BPdISSI (BM ! = rt™ 2 kg/m(2>) W6re StUdi6d before BPD - d 2 « ™nte*Z 
BPD. Skeletal muscle UCP2 and LTCP3 mRNA was measured using reverse 

transenptese-competitive polymerase chain reaction and UCP3 protein by Western 
blotting Intrarnyocyt.c triglycerides were quantified by high-performance liquid 
3^^^' Twent y- four - hour e^rgy expenditure and respiratory quotient (RQ) 
were measured m a respiratory chamber. RESULTS: After BPD, the average weLhUoss 

TowT^I, * N ° npr0tein RQ W3S inCreased in * e P^bese subject 
1 0 ?6 to I C0 + , n oa ' ^ deride level dropped (3 66 

UC?2 and UCP3 mRNA °!f ^ P < 0 -°° 0l) after BPD of 

UCP2 and UCP3 mRNA was sigruficandy reduced (from 35.9 +/- 6. 1% to 18 6 +/- 4 5% 

U^oTot' P ?f ^ & 7 6 °- 2 +/ " I4 " 0% t0 33 4 +/ " 8 5% > P =° respeetivelyf 
fin? ? P .? r m _ 00,,tent was 3180 significantly reduced (272.19 +/- 84.13 vs 175 78 +/ 

levels ^ 'n P nn?; 04) ' A ""Y? re 8 ression ^ysis (R(2) = 0.90) showed that CMTG 
urX ? " t ^^ tCd mOSt p ° werM "^pendent variable for predicting 
SSST DrSC "\ S,0hr: The Stro "g «i«*-ion of UCP expression^ decrease 
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Differential expression of the short and long forms of the gamma 2 subnnit 
of the GABAA/benzodiazepine receptors. 

Miralles CP , Gutierrez A . Mian ZU . Vicorica J , De Bias AL . 

Division of Molecular Biology and Biochemistry, School of Biological Sciences, 
University of Missouri-Kansas City 641 10-2499. 

The distribution of the mRNAs encoding the gamma 2S and gamma 2L subunits of the 
GABAA receptor in the rat brain has been revealed by in situ hybridization, northern blot 
and dot blot analysis using specific antisense oligonucleotides. In addition, the 
quantitative distribution of the gamma 2S and gamma 2L subunit peptides participating in 
the fully assembled GABAA recepto^enzodiazepine receptors has been mapped by 
immunoprecipitation with specific anti-gamma 2S and anti-gamma 2L antibodies. 
Several neuronal types and brain regions are enriched in gamma 2L such as neurons of 
the layer It of striate cortex and cerebellar Purkinje cells as well as the inferior colliculus, 
superior colliculus, deep cerebellar nuclei, medulla and pons. Other neuronal types and 
regions are enriched in gamma 2S such as the mitral cells of the olfactory bulb, pyramidal 
neurons of the pyriform cortex, layer VI of the neocortex, granule cells of the dentate 
gyrus and pyramidal cells of the hippocampus. Other cortical areas and cerebellar granule 
cells express both gamma 2S and gamma 2L in comparable amounts: There is a good 
correlation between the relative expression of gamma 2S and gamma 2L mRNAs and the 
relative presence of these protein subunits in folly assembled and mature receptors in the 
studied brain regions. The differential distribution of gamma 2S and gamma 2L might 
result in differential ethanol sensitivity of the neurons expressing these GABAA receptor 
subunits. 
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Follicle-Stimulating Hormone Receptor and Its Messenger Ribonucleic Acid Are 
Present in the Bovine Cervix and Can Regulate Cervical Prostanoid Synthesis 1 

D. Mizrachi and M. Shemesh 2 

Department of Hormone Research, Kimron Veterinary Institute, Bet Dagan, Israel 50250 



ABSTRACT 

The hypothesis that FSH regulates the bovine cervical pros- 
taglandin E 2 (PCE 2 ) synthesis that is known to be associated with 
cervical relaxation and opening at the time of estrus was inves- 
tigated. Cervical tissue from pre-estrous/estrous, luteal, and 
postovulatory cows were examined for 1) the presence of bovine 
(b) FSH receptor (R) and its corresponding mRNA and 2) the 
effect of FSH on the PGE 2 regulatory pathway in vitro. The pres- 
ence of bFSHR mRNA in the cervix (maximal during pre-estrus/ 
estrus) was demonstrated by the expression of a reverse tran- 
scription (RT) polymerase chain reaction (PCR) product (384 
base pairs) specific for bFSHR mRNA and sequencing. Northern 
blotting revealed three transcripts (2.5, 3.3, and 3.8 kilobases 
[kb]) in cervix from pre-estrous/estrous cows. The level of FSH R 
(75 kDa) was significantly higher (p < 0.01) in Western blots of 
pre-estrous/estrous cervix than in other cervical tissues. There 
was a good correlation between the 75-kDa protein expression 
and its corresponding transcript of 2.55 kb throughout the es- 
trous cycle as described by Northern blot analysis as well as RT- 
PCR. Incubation of FSH (10 ng/ml) with pre-estrous/estrous cer- 
vix resulted in a 3-fold increase in the expression of FSHR and 
a 2-fold increase in both G protein (a s ) and cyclooxygenase II. 
| FSH (5-20 ng/ml) significantly increased (p < 0.01) cAMP, ino- 
sitol phosphate (p < 0.01), and PGE 2 (p < 0.01) production by 
pre-estrous/estrous cervix but not by cervix at the other stages. 
We conclude that bovine cervix at the time of the peripheral 
plasma FSH peak (pre-estrus/estrus) contains high levels of 
FSHR and responds to FSH by increasing the PGE 2 production 
responsible for cervical relaxation at estrus. 

INTRODUCTION 

FSH and LH are the two known pituitary hormones that 
regulate gonadal functions. FSH is essential for female fer- 
tility, specifically for folliculogenesis, and human ovarian 
failure has been related to mutations both in the FSH re- 
ceptor (FSHR) and in the FSHp gene [1]. However, unlike 
LH, for which extra-gonadal binding has been documented 
for the uterus in the cow [2], pig [3], and human [4], no 
extra-gonadal effects of FSH have been reported. The uter- 
ine LH receptors have been shown to be physiologically 
active in the cow [2, 5]. The concentration of uterine LH 
receptors is related to the phase of the cycle, and binding 
of LH to uterine receptors increases uterine prostaglandin 
synthesis [2, 5]. Preliminary observations in our. laboratory 
indicated that both FSH and LH receptors are also present 
in the cervix; We therefore investigated whether FSHRs are 
associated with the regulation of cervical prostaglandin, 
particularly around the time of estrus, when both cervical 
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PGE 2 production [6] and peripheral FSH concentrations are 
maximal. 

The present investigation was therefore conducted to de- 
termine the presence of FSHR and its mRNA, and its physr 
iological activity, in the bovine reproductive tract. Since the 
cervix is a prostaglandin E x (PGEj)- and PGE 2 -sensitive 
tissue and responds to hormonal stimulation, we also de- 
termined whether FSH could increase cervical cyclooxy- 
genase expression and production of PGE 2 in vitro, partic- 
ularly at the time near estrus. 

MATERIALS AND METHODS 

Animals 

Cervical and ovarian tissues were collected from Hol- 
stein cows at a local abattoir. The stage of the cycle was 
determined according to signs of ovulation and the status 
of corpora lutea (CL; "weight, color). These parameters al- 
lowed the classification of the cervix into three groups: 1) 
pre-estrous/estrous (follicular, 18-20 days postovulation; 
regressed CL < 1.0 g; presence of cervical mucus; n = 
54); 2) postovulatory (1-4 days postovulation; presence of 
corpora hemorrhagica; signs of follicular rupture; n = 35), 
and 3) luteal (12-16 days postovulation; CL 4-6 g; n = 
44). Bovine granulosa cells were collected from pre-es- 
trous/estrous follicles and used as a positive control for 
FSHR expression [7]. 

The cervix external os segment was taken as the initial 
3 cm of the cervix (total length of cervix is about 9 cm). 
The cervical os segments, primarily luminally oriented 
muscularis with epithelium, were sliced into horizontal 
strips (1 cm in length), pooled, and minced finely with a 
scalpel. The Kimron Veterinary Institute Animal Care and 
Use Committee approved . all procedures. 

FSHR Cene Expression 

RNA isolation. Bovine cervical or ovarian tissues were 
rapidly dissected and frozen in liquid nitrogen (within 20 
min after slaughter). Total RNA was extracted using the 
acid phenol, guanidine thiocyanate technique [8], using 
TriReagent (Molecular Research Center, Inc., Cincinnati, 
OH) according to the manufacturer's instructions. 

Reverse transcription (RT)-polymerase chain reaction 
(PCR). Bovine FSHR mRNA was detected as previously 
described for the LH receptor mRNA [5]. Oligonucleotides 
corresponding to the published sequence of the bovine 
FSHR [7] were synthesized and used as primers in a PCR 
reaction. Bovine ovarian and cervical tissue cDNA ob- 
tained by RT of 2 |xg RNA was used as a template in the 
PCR reaction using a pair of primers corresponding to the 
transmembrane segments of the FSHR. The primers, 20- 
mer each, were selected using the OL1GO Program (Oligo 
Rl primer analysis software; National Biosciences, Plym- 
outh, MN). The forward primer corresponded to position 
1399-1419 (5 ' CGGCTTTTTC ACTGTCTTTG3 ') on the 
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bovine FSHR mRNA [7]. The reverse primer was a 20-mer 
oligonucleotide complementary to position 1763-1783 
(5'CGCTTGGCTATCTTGGTGTC3'). The predicted size 
of the RT-PCR product was 384 basepairs (bp). Four mi- 
croliters of the RT reaction were used as a template for 
PCR; the reaction was allowed to proceed 35 cycles using 
2 U Termus Aquaticus DNA polymerase (recombinant) 
provided by MBI Fermentas, Vilnius, Lithuania, and 200 
pmol of each primer. The cycling parameters of the PCR 
were 98°C for 20 sec, 56°C for 45 sec, and 72°C for 45 
sec. After amplification the samples were separated' on a 
1% agarose gel, stained with ethidium bromide, and pho- 
tographed under UV light. The 384-bp fragment was ex- 
tracted using the Wizard PCR Preps Kit (Promega, Madi- 
son, WI) and sequenced with the upper primer using an 
automatic sequencer (Applied Biosystems, Foster City, 
CA). For internal control, a bovine p-actin fragment of 890 
bp was produced from an upper primer (5'ACCA- 
ACTGGG ACG AC ATGGAG3 ' ; 21 mer) and a lower prim- 
er (5 ' GC ATTTGCGGTGG AC A ATGGA3 ' ; 21 mer) as pre- 
viously described [5]. Each PCR amplification was stan- 
dardized using ovarian granulosa cDNA to produce a spe- 
cific band of the predicted size. Standardization was based 
on temperature, cycle number, Mg level, and pH. 

Western Blots 

Cervical, bovine seminal vesicle, or muscle minces were 
sonicated for 45 sec, lysed in 500 pi lysing buffer (1% 
Nonidet-40 [Sigma, St. Louis, MO], 2 mM EGTA, 5 nM 
MgCl ? , 1 mM PMSF in PBS) and incubated for 2 h at 4°C. 
After centrifugation, aliquots were taken for protein deter- 
mination using gamma globulin (Sigma) as reference stan- 
dard [9] and a protein dye binding method (Bio-Rad Lab- 
oratories, Richmond, CA). Sixty micrograms of protein of 
the soluble tissue extract was then separated by electropho- 
resis on 5-9%-gradient SDS-PAGE and electrob lotted onto 
nitrocellulose paper (Amersham, Little Chalfont, Bucks, 
UK) as described previously [9]. The nitrocellulose mem- 
brane was washed with PBS containing 0.05% TWEEN-20 
and blocked with 10% horse serum (Kimron Veterinary In- 
stitute) in washing solution. The nitrocellulose membrane 
was then treated with either 1) an anti-peptide antibody 
(code name 179) raised against human (h) FSHR peptide 
265-295, diluted 1:500 (donation of Dr. J.A. Dias, Wads- 
worth Center, New York State Department of Health); 2) 
rabbit antiserum for bovine G protein (a s ) diluted 1:1000; 
UBI, Lake Placid, NY); or 3) rabbit anti-bovine cycloox- 
ygenase polyclonal antiserum (diluted 1:200; Kimron Vet- 
erinary Institute) [9]. Different dilutions were used for each 
antibody, and the final dilution used was 75% of the dilu- 
tion, which gave a maximal signal as previously described 
[9]. The nitrocellulose paper was then incubated with horse- 
radish peroxidase-conjugated goat anti-rabbit IgG (Sigma 
Israel, Rehovot, Israel; diluted 1:2000 in washing buffer) 
for 1 h at room temperature. The presence of FSHR, G 
protein (a s ), or cyclooxygenase was then visualized by 
means of a color reaction as follows. The nitrocellulose 
paper was incubated in a substrate solution containing 3'3'- 
diaminobenzidine (0.5 mg/ml; Sigma) in a mixture of PBS 
containing 0.5% CaCl 2 and 6% H 2 0 2 . The antibody to 
hFSHR recognized the 75-kDa protein of the FSHR. The 
antibody to G (a s ) recognized both the 42-kDa and 87-kDa 
forms of this protein, and the antibody to cyclooxygenase 
recognized the 72-kDa form (cyclooxygenase 11). Extract 
of bovine seminal vesicles that are known to have a high 



content of cyclooxygenase was used as positive control as 
previously described [9]. The densitometric scans were ob- 
tained using a bio-imaging system (B.I.S. 2020; Rhenium 
Dingo, Jerusalem, Israel) and processed with Tina 2.0 soft- 
ware (Fuji, Japan). Linearity of detection [9] was deter- 
mined for densitometry for both Western and Northern 
blots. Each Western blot was evaluated in the absence of 
the first antibody, and no signal was detected. 

Northern Blot Analysis 

For Northern blots, 10 pg of total RNA was denatured 
in 20 pi 50% formamide/2.2 M formaldehyde in single- 
strength 3-(»-morpholino) propanesulfonic acid (MOPS) 
buffer (0.04 M MOPS, 10 mM sodium acetate, 1 mM 
EDTA) at 65°C for 10 min. Samples were placed on ice, 
and 5 pi of loading buffer (0.5% SDS, 0.25% bromophenol 
blue, 25% glycerol, 25 mM EDTA) was added. Samples 
were subjected to electrophoresis through a 1.2% agarose 
formaldehyde gel and transferred by capillary blotting to 
nylon membranes (Nytran; Schleicher and Schuell, Keene, 
NH). RNA was subsequently UV cross-linked to mem- 
branes. The RT-PCR product fragments of 384 bp for 
FSHR and 890 bp for 3-actin were used to generate bioti- 
nylated probes using random primer biotin labeling of DNA 
for chemiluminescence (NEBIot phototype kit; Biolabs, 
Beverly, MA) according to the manufacturer's instructions. 
The blotted membrane was prehybridized at 65°C for 1 h 
in prehybridization solution (6-strength SSC [3 M NaCl, 
0.3 M sodium citrate], 5-strength Denhardt's reagent [1%. 
ficoll, 1% polyvinylpyrrolidone, 1% BSA], 0.5% SDS, and 
100 p,g/ml denatured salmon sperm DNAs) and hybridized 
overnight in the same solution with the denatured biotiny- 
lated probe to the target RNA. The membrane was then 
washed in double-strength SSC, 0.1% SDS at room tem- 
perature for 10 min and washed again in 0.1-strength SSC, 
0.i% SDS at 68°C for 15 min followed by subsequent 
chemiluminescence, development, and detection on x-ray 
film (RX; Fuji Film, Tokyo, Japan). To obtain quantitative 
data for the specific mRNA, we used the densimetric value 
for (3-actin mRNA to normalize each specific mRNA value. 
To remove FSHR probe, membranes were stripped using 
water for 15 min, then incubated in 0.4 M NaOH, 0.1% 
SDS at 80°C for 30 min. The membrane was rinsed again 
in 0.2 M Tris-HCl, 0.1-strength SSC for 30 min at 25°C. 
After stripping, prehybridization and hybridization with (3- 
actin probe were performed as described for FSHR mRNA. 

Cyclic AMP Determinations 

Cervical tissue minces (50 mg/ml) were incubated for 
10 min in wells of 1.5 ml containing 1.0 ml of Tissue Cul- 
ture Medium- 199 (TCM-199 without serum (Biological In- 
dustries, Beit Haemek, Israel) in the absence or presence 
of physiological (5, 10 ng/ml) and pharmacological doses 
(20, 40 ng/ml) of FSH (USDA-bFSH-8-1; no further in- 
crease in cAMP production was seen at 50 or 100 ng/ml) 
or forskolin, a stimulator of adenylate cyclase (10 |xM; Sig- 
ma). The dose of forskolin was selected to give a 3-fold 
increase in cAMP. At the end of the incubation, tissues were 
removed, blotted on fitter paper to remove mucus, and in- 
cubated overnight at 4°C with 400 jxl of 3% HC10 4 . The 
solution was then neutralized with 150 jxl of KHC0 3 and 
centrifuged, and 50-|xl aliquots were taken for the radiore- 
ceptor assay as described by Brown et al. [10] and modified 
for endometrium by Miyazaki et aL [11]. Standards (0- 
1000 pg of cAMP) or samples were incubated at room tern- 
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FIG. 1 . RT-PCR amplification of bovine cervix FSHR and 0-actin mRNA. 
RT-PCR was performed as described in Materials and Methods for FSHR 
and 0-actin. Separate RT-PCR amplification products of FSHR (estimated 
384 bp) and f^actin (estimated 890 bp) were combined and separated on 
agarose gel electrophoresis and stained with ethidium bromide. RT-PGR 
was performed using 2 ng RNA of bovine granulosa cells as a positive 
control (lane 1), cervical tissue at the pre-estrous/estrous (lanes 3-4), post- 
ovulatory (lanes 5-6), and luteal phases (lanes 7-8). Lanes 9-1 0 represent 
reamplification, using as template the PCR product, cDNA; from the lu- 
teal-phase amplification. Lane 2 represents negative control in the ab- 
sence of RNA. The tissue was taken from 4 cows at each stage of the 
estrous cycle, and two representative samples from each stage are shown. 
SM, size markers. 



perature with 100 \i\ of cAMP binding protein prepared 
from bovine adrenal extracts as described [10]. The buffer 
used for the assay was added to form a final reaction vol- 
ume of 400 pi After 2-h incubation in a cold room, the 
reaction was stopped by adding 300 jxl of a charcoal-dex- 
tran solution; centrifugation followed, and the supernatant 
was removed for counting in a scintillation counter. The 
sensitivity of the assay was 60 pg/tube. Quadruplicate as- 
says were made for each value determined. The within- 
assay and between-assay coefficients of variance were 8% 
and 10%, respectively. 



1401 

BTFSH GCTTTTTCAC TGTCTTTGCC AGTGAGCTCT CAGTCTACAC TCTGACCGCC 
Cervix GCTTTTTCAC TGTCTTTACT AGTGAGCTCT CAGTCTACAC TCTGACCGCC 

BTFSH ATCACGCTGG AAAGATGGCA TACCATCACC CATGCCATGC AGCTCGAATG 
Cervix ATCACGCTGG AAAGATGCCA TCCCATCACC CGTGCCATGC AGCTCGAATG 

BTFSH CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 
Cervix CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 

BTFSH TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 
Cervix TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 

' BTFSH AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 
Cervix AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 

BTFSH CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 
Cervix CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 

BTFSH GTGGCTGCTA CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 
Cervix GTGGCTXCTT CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 

1761 

BTFSH TCCTCTTCTA GTG 
Cervix TCCTCTTCTA GTG 

FIG. 2. Automated nucleotide sequencing and homology between Bos 
taurus mRNA FSHR (CenBank accession number L22319) from nucleo- 
I tides 1401 to 1 763 and the PCR products obtained using the upper primer 
as described in Materials and Methods. A homology of 97.5% was found 
between Bos taurus and our amplified cDNA obtained by RT-PCR from 
cervical mRNA at pre-estrus/estrus, suggesting that they are complemen- 
tary. 



Measurement of Phospholipase C (PLC) Activity 

Cervical minces (30 mg) were incubated in 1 ml TCM- 
199 containing 5 mCi [ 3 H]myo-inositol for 90 min. Slices 
were rinsed with 1 ml TCM-199 and incubated for another 
60 min to remove unincorporated [ 3 H]myo-inositol. At the 
end of the preincubations, slices were incubated in 1 ml 
TCM-199 containing 10 mM LiCl to inhibit inositol phos- 
phate (IP) hydrolysis, allowing 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 to 
accumulate in the tissue after stimulation of PLC, in ab- 
sence or presence of 0-20 ng/ml of FSH or 0-20 ng/ml 
LH (USDA bLH-I-1). Incubations were terminated by add- 
ing 2 ml ice-cold chloroform/methanol/hydrochloric acid 
(5:10:0.1, v:v:v) and then 1 ml chloroform and 1 ml EDTA 
(5 mM) and extracted as described by Kisielewska et al. 
[12]. 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 were separated from the 
labeled inositol compounds by using Dowex AG 1-X8 (for- 
mate form). 3 H-Labeled IP was eluted from the column by 
sequential elution into 10 fractions. Aliquots of each frac- 
tion were added to scintillation vials containing 5 ml scin- 
tillation fluid, and radioactivity was determined using a 
scintillation counter. Total activity of PLC is expressed as 
cpm of total IPs/30 mg/30 min. 

RIA for PCE 2 and PCF, a 

Aliquots of 100 jxl were taken at the end of the incu- 
bation period for specific RIA of PGE 2 and PGF 2a , which 
were performed without chromatographic separation. The 
antisera for PGF 2a (Sigma) reacts preferentially with PGF 2u 
but cross-reacts with PGF la (60%) and to a negligible ex- 
tent (< 0.1%) with prostaglandins of the A, B, and E series. 
The antisera to PGE 2 (Sigma Israel) reacts preferentially 
with PGE 2 but cross-reacts with PGE, (20%), PGA,, PGA 2 , 
PGF la , and PGF 2n (< 10%) and to a negligible extent with 
PGB! and PGB 2 (< 0.1%). The intraassay coefficients of 
variation were 9% and 11%, and the interassay coefficients 
of variance were .12.3% and 13% for PGF 2ft and PGE 2 . 

Statistical Analysis 

ANOVA was performed with a significance level of p 
< 0.05. Data were further analyzed using Tukey's proce- 
dure (p < 0.05 or p.< 0.01) to assess significance between 
treatments. Student's /-test was used where appropriate. 
Values are expressed as means ± SEM. 

RESULTS 

FSHR mRNA Expression in the Cervix 

RT-PCR demonstrated that the FSHR gene is expressed 
in bovine cervix as indicated by the presence of the 384- 
bp fragment corresponding to bFSHR mRNA. The mRNA 
was maximally expressed in the cervix during the pre-es- 
trous/estrous phase. In contrast, the expression of the gene 
was reduced in postovulation cervix and not detectable in 
luteal-phase cervix (Fig. 1). However, reamplification of the 
PCR product, cDNA, from the luteal phase produced a de- 
tectable signal (Fig. 1). 

To demonstrate that the absence of FSHR mRNA in lu- 
teal-phase cervix was specific, bovine (3-actin mRNA was 
used as internal control. PCR demonstrated that the specific 
890 bp of the p-actin cDNA band was produced by the 
cervical tissues of all the stages. The band for p-actin was 
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FIG. 3. Northern blot for analysis of total RNA (10 jxg) extracted from 
bovine cervical tissues collected from pre-estrous/estrous, postovulatory, 
and luteal-phase cows, and bovine granulosa cells as a positive control. 
Relative migration of 28S and 1 8S ribosomai RNA is indicated. To localize 
mRNA for FSHR and 0-actin, our RT-PCR products (384-bp and 890-bp 
fragments, respectively) were used to generate biotinylated probes using 
biotin random primer. Figure shows results for FSHR mRNA in bovine 
granulosa cells as a positive control (lane A); and in cervical tissues at 
pre-estrus/estrus (lanes B-C), postovulation, (lanes D-EJ and the luteal 
phase (lanes F-G). The corresponding fi-actin band was obtained by re- 
probing of the membranes previously tested for FSHR mRNA as described 
in Materials and Methods. Columns show mean ± SEM of the optical 
density of the ratio FSHR:0-actin, in which the luteal phase is given an 
arbitrary value of 1.0. "Significantly different from luteal-phase value (p 
< 0.01); n - 5 for each stage. 




FIG, 4. Presence of FSHR protein in cervical tissues as determined by 
Western blot. Soluble cell extracts (60 jig protein) were used to determine 
the amount of FSHR as described in Materials and Methods. Western blot 
proteins were transferred to nitrocellulose paper and probed with anti- 
serum to peptide hFSHR, 265-296. Western blot displays a representative 
result for bovine granulosa cells, used as positive control (lane A); bovine 
seminal vesicles (lane B) and muscle, used as a negative control (lane C); 
and cervical tissues from three stages of the estrous cycle: pre-estrous/ 
estrous (lane D), postovulatory (lane E), and luteal (lane F). Histogram 
summarizes the results for the three stages of the estrous cycle. Columns 
show mean ± SEM of the relative optical density in which the luteal phase 
is given an arbitrary value of 1 .0. "Significantly different from luteal-phase 
value (p < 0.01 ); n = 7 for each stage. 



present even though the 384-bp band corresponding to the 
FSHR was absent (Fig. 1). 

The nucleotide sequence (Fig. 2) of the 384-bp fragment 
obtained from pre-estrous/estrous cervix was compared 
with that for the Bos taums FSHR reported by Houde et 
al. [7]. It was found that there was a 97.5% homology be- 
tween the 384-bp fragment and the nucleotide range 1410— 
1763 found in the Bos taurus FSHR. 

The Northern blot contained the expected three tran- 
scripts of 2.55, 3.3, and 3.8 kilobases (kb) present in the 
bovine granulosa. However, the high (6.8-kb) and low (1.6- 
kb) transcripts found in the ovary were not found in the 
cervix. The three transcripts were found in pre-estrous/es- 
trous cervix, but only one transcript (2.55 kb) was seen in 
luteal cervix (1/8 of the pre-estrus/estrus level; n = 5) or 
postovulatory cervix (1/3 of the pre-estrus/estrus level; n = 
5) (Fig. 3). 
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FIG. 5. Effect of incubation of cervical tissues with FSH on the expres- 
sion of the FSHR protein. Cervical tissue from 5 pre-estrous/estrous cows 
"was incubated in the absence or presence of FSH (0 or 1 0 ng/ml). Western, 
blot is representative for FSH protein concentration after 3 h of incuba- 
tion, with granulosa cell extract serving as positive control. Lane A, Gran- 
ulosa cells (GO; lane B, cervical tissue at Time 0; lane C, cervical tissue 
after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 h of 
incubation with 10 ng/ml of FSH. Lane E represents muscle tissue incu- 
bated with FSH (10 ng/ml) for 3 h as a negative control. Histogram sum- 
marizes the results of six experiments. Columns are means ± SEM. ♦Sig- 
nificantly different (p < 0.01) from the column representing the control 
at 3 h (a given value of 1). 



FSHR Protein in the Cervix 

The presence of FSHR protein in the cervix was dem- 
onstrated as determined by Western blot analysis using a 
specific antipeptide antibody for hFSHR. The antibody rec- 
ognized a 75-kDa protein in both bovine granulosa and 
cervical preparations identical with the predicted molecular 
size of FSHR. The signal for the 75-kDa protein was stron- 
gest in pre-estrous/estrous cervix when compared with 
postovulation- (3-fold) or luteal-phase (6-fold) cervix (Fig. 
4). With the use of 60 ng protein in each lane, this protein 
was observed to be expressed throughout the estrous cycle. 

In Vitro Effect of FSH on Induction of FSHRs 

A time-course (0.75, 1.5, and 3 h) and dose-response (0, 
10, and 20 ng/ml) study was carried out to determine in 
vitro effect of FSH on. its own receptor. Cervical minces 
(100 mg) from cows in the pre-estrous/estrous, postovula- 
tory, and luteal phases were used. It was found, as deter- 
mined by Western blot, that FSH induced its own receptor 
in cervical tissues at pre-estrus/estrus, with a 3-fold increase 
O < 0.01) observed at 10 ng/ml after 3 h of incubation 
(Fie. 5). However, no effect of FSH was seen at earlier 




FIG. 6. Effect of incubation of cervical tissues with FSH on the synthesis 
of G protein. Cervical tissues from 5 pre-estrous/estrous cows were in-, 
cubated in the absence or presence of FSH (10 ng/ml). Both the 87-kDa 
G protein (a, 0, y complex) and 42-kDa protein (a,) are elevated at 3 h 
of incubation. Granulosa cell extract was used as a positive control. Lane 
A, Granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, cervical 
tissue after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 
h of incubation with 10 ng/ml of FSH. Columns are means ± SEM of 42- 
kDa protein. 'Significantly different (p < 0.05) from the column repre- 
senting the control at 3 h (with an arbitrary value of 1 .0). 



times at the 10 ng/ml level. When 20 ng/ml was used, a 
stimulatory effect (2-fold) was seen after 1.5 h. No signif- 
icant response was observed when tissues of the postovu- 
latory or luteal phase were used at any dose or time tested 
(data not shown). 

Effect of FSH Treatment on C Protein (a J 

A time-course and dose-response study was carried out 
to determine whether cervical FSHR was coupled to G pro- 
tein. Cervical minces (100 mg) from cows in the pre-es- 
trous/estrous, postovulatory, and luteal phases were incu- 
bated in the presence of FSH (0-20 ng/ml) for 0.75, 1.5, 
and 3 h. It was found in pre-estrous/estrous cervical tissue 
that, in the presence of FSH (10 ng/ml), there was a sig- 
nificantly (p < 0.05) higher level of G protein (a s ) at the 
level of both the 87-kDa (a, p, 7 complex) and the 42-kDa 
(a s ) proteins after 3 h of incubation (Fig. 6). FSH, at either 
10 or 20 ng/ml, did not have a stabilizing effect on the 
level of G protein (a s ) at shorter times of incubation (0.75 
or 1.5 h). No significant response was observed when tis- 
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FIG. 7. Effect of FSH on IP production in cervical tissue. Cervical tissues 
from six cows at each of three stages of the cycle were incubated for 30 
min in the absence or presence of FSH (10 ng/ml) or LH (ng/ml). IPs 
produced were determined in four replicates by amount of labeled com- 
pound (cpm) produced. Columns are means ± SEM (*p < 0.05; **p < 
0.01 ; significantly different from control value for each stage of the cycle). 



sues of the postovulatory or luteal phase were used at any 
dose or time tested (data not shown). 

IP Pathway Activation by FSH 

For IP measurement, cervical tissue minces (30 mg) 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated for 30 min in the absence or presence 
of FSH (10 ng/ml) or LH (10 ng/ml). FSH significantly {p 
< 0.01) increased the level of IP, IP 2 , and IP 3 at pre-estrus/ 
estrus. However, FSH significantly (p < 0.05) inhibited IP 
level in the postovulatory stage. In contrast, LH, but not 
FSH, stimulated IPs significantly O < 0.01) at the luteal 
phase (Fig. 7). 

Adenylate Cyclase Activation by FSH 

A dose-response analysis was used to determine whether 
the cervical FSHR was coupled to adenylate cyclase. In 
these experiments, cervical tissue minces (50 mg) obtained 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated with FSH (0-40 ng/ml). FSH in- 
creased cAMP accumulation in a dose-dependent manner 
when incubated with pre-estrous/estrous cervical tissues, 
with maximal stimulation 2.5 times (p < 0.01) that of con- 
trol in the presence of 10 ng/ml (Fig. 8 A). In contrast, cer- 
vical tissue from luteal or postovulation did not respond to 
FSH even though tissues from all phases of the cycle re- 
sponded to forskolin with a significant {p < 0.01) increase 
in cAMP (Fig. 8B). 

Effect of FSH on PGE Production by Cervical Tissues 

A dose-response analysis was carried out to determine 
whether FSHR was associated with cervical prostaglandin 
production. PGE 2 and PGF 2u production by cervical minces 
(100 mg/ml) in the presence of FSH (0-20 ng/ml) was de- 
termined by R1A. FSH stimulated (p < 0.01) basal PGE 2 
production in a dose-dependent manner and caused a 3-told 
increase in PGE 2 production at 10 ng/ml in pre-estrous/ 
estrous cervical tissues (n = 8. Fig. 9). FSH had no effect 
on the small amount of basal PGF 2q (< 1 ng/100 mg) pro- 
duced by cervical tissue (data not shown). FSH did not 
elevate PGE 2 in cervical tissues from the luteal phase; fur- 
thermore, a small but significant (p < 0.05) inhibition was 
observed at postovulation (Fig. 9). 
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FIG. 8. Effect of FSH on adenylate cyclase production in cervical tissue. 
A) Cervical tissues from six cows at pre-estrus/estrus were incubated (4 
replicates) for 10 min in the absence or presence of FSH (0, 5, 10, 20, or 
40 ng/ml) or forskolin (10 p.mol/L), and the amount of cAMP produced 
was measured by a protein binding assay. B) Cervical tissues from six 
cows at each of the three stages of the estrous cycle were incubated (4 
replicates) for 10 min in the absence or presence of FSH {10 ng/ml) or 
forskolin (10 jxM). The amount of cAMP produced was measured by a 
protein binding assay. Columns are means ± SEM. Columns with asterisks 
were statistically different from their own control (*p < 0.01; **p < 0.05). 
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FIG. 9. FSH dose response on production of PGE 2 by cervical tissues. 
Cervical tissues from 8 pre-estrous/estrous, postovulatory, or luteal-phase 
cows were incubated in the absence or presence of FSH (5, 10, or 20 ng/ 
ml) for 6 h, and the effect on the production of PGE, was measured by 
RIA. *p < 0.01; **p < 0.05. 
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FIG. 10- Effect of incubation of cervical tissues with FSH on the ex- 
pression of cyclooxygenase. Cervical tissues from 5 pre-estrous/estrous 
cows were incubated in the absence or presence of FSH (10 ng/ml). The 
72-kDa cyclooxygenase II protein was elevated at 3 h of incubation. Lane 
A bovine granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, 
cervical tissue after 3 h of incubation in TCM-199; lane D, cervical ttssue 
after 3 h of incubation with 10 ng/ml of FSH. Columns are means ± SEM 
of 72-kDa protein. 'Significantly different (p < 0.01) from the column 
representing the control at 3 h (with an arbitrary value of 1 .0). 



Cyclooxygenase Activation by FSH 

To determine whether the presence of FSHR was asso- 
ciated with the expression of cyclooxygenase, cervical 
minces were incubated for 3 h in the presence or absence 
of FSH (10 ng/ml). Tissues from pre-estrous/estrous, post- 
ovulatory, and luteal-phase cows were extracted, separated 
on SDS-PAGE, and tested for the cyclooxygenase II (72 
kDa) using a specific antibody. It was found that in cervical 
tissues from six pre-estrous/estrous cows, FSH induced a 
200% increase in the expression of cyclooxygenase after 3 
h of incubation (Fig. 10). In contrast, no effect on cyclo- 
oxygenase expression was seen in cervical tissues from lu- 
teal-phase cows, and an insignificant (p > 0.05) elevation 
at the postovulatory phase was observed (data not shown). 

DISCUSSION 

Results of this study demonstrate by both PCR amplifi- 
1^ cation and Northern blot analysis that FSHR mRNA is pres- 
* ; ent in the bovine cervix. The bands corresponding to the 
receptor (75 kDa) and its mRNA were maximally expressed 
during the pre-estrous/estrous phase. Incubation of cervical 
tissues with FSH increased the expression of cyclooxygen- 



ase and the production of PGE 2 (and possibly ?GE X ) at the 
pre-estrous/estrous phase. The stimulation of cervical cy- 
clooxygenase was associated with activation of both the 
PLC and adenylate cyclase second messenger G-protein re- 
lated pathways. 

Northern blot analysis of bovine cervical tissues from 
pre-estrous/estrous cows revealed multiple transcripts for 
bovine cervical FSHR. The sizes of the major transcripts 
were 2.5 kb, 3.3 kb, and 3.8 kb, which were similar to those 
obtained in bovine granulosa cells [7], rat testis and ovary 
[13, 14], and human ovary [15] and myometrium [16]. Oth- 
er minor transcripts have been reported in all of these spe- 
cies. The RT-PCR product yielded a single band of 384 bp, 
which was the expected molecular size, so the different 
isoforms of the FSHR are probably the result of differential 
splicing of the same transcript. Isoforms of FSHR have 
been characterized in human [17] and primate [18] ovary 
and in ovine testis [19]. Since only a single 2.5-kb isomer 
was present in luteal-phase cervix (which did weakly pro- 
duce FSHR), it would appear that there are different iso- 
forms present in the bovine cervix as well. 

The nucleotide sequence homology between our RT- 
PCR 384-bp product was 97.3% identical to the comparable 
region of the bovine FSHR, showing that our amplified 
cDNA was complementary to the Bos taurus mRNA FSHR. 
The PCR product of 384-bp cDNA was found primarily in 
the pre-estrous/estrous cervix. 

Using specific antibody raised in rabbits against hFSHR 
(amino acid sequence 265-296) for Western blot analysis 
resulted in a major signal for a 75-kDa protein. A protein 
of similar molecular mass has been reported m rat and hu- 
man ovary [20]. A strong signal for this protein was seen 
in the pre-estrous/estrous cervix compared with cervix from 
the other stages of the cycle. This cervical receptor was 
regulated by FSH itself, similar to the regulation of the 
FSHR in the rat ovary [14]. 

FSH increased cAMP production by cervical tissue from 
the pre-estrous/estrous phase but did not elevate cAMP in 
the postovulatory and luteal phases. FSH elevation of IPs 
displayed the same pattern. It would therefore appear that 
the cervical FSHR is associated with signal transduction 
pathways in a way similar to the accepted mechanism of 
LH action in the ovary and testes, i.e., activation of .the 
second messenger pathways — adenylate cyclase [21] and 
phosphatidyl-inositol (PLC) [22, 23]. The inhibitory effect 
of FSH on IPs and cAMP at postovulation occur when 
FSHR is low. Since FSH activity is biphasic, i.e., is stim- 
ulated by a low dose and inhibited by a high dose, an in- 
hibitory effect at low concentrations could be the result of 
the change in the ligand-receptor ratio. 

FSH induced about a 200% increase in the expression 
of cyclooxygenase at pre-estrus/estrus but had no effect on 
the two other stages of the cycle. This was similar to the 
response of cyclooxygenase to LH that was observed for 
the bovine endometrium [2] and uterine vein [6]. However, 
in the endometrium, LH increased cyclooxygenase during 
both the luteal and postovulatory phases but not at estrus 
[2]. Furthermore, the induction of cyclooxygenase by go- 
nadotropin in the endometrium was associated with an in- 
creased level of PGF 2ft , while in the cervix PGE 2 was the 
major product. 

The bFSH was of immunological grade and devoid ol 
residual biological contamination of LH, thyroid-stimulat- 
ing hormone, growth hormone, prolactin, or ACTH. The 
effects on increasing cAMP, PGE 2? and cyclooxygenase as 
well as induction of FSHR were therefore specific. Fur- 
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thermore, it was found that highly purified hFSH (10-20 
ng/ml) (donation of J.A. Dias) had the same effects as 
bFSH when incubated with cervical tissue under the same 
conditions (2.5-fold increase in cAMP; 3-fold increase in 
PGE 2 ; 2-fold increase in cyclooxygenase). 

The relaxation and opening of the gravid cervix is due 
to active biochemical and structural changes in the cervical 
connective tissue that are mediated in part by prostaglan- 
dins [24]. It is thought that stretching of the cervix is a 
factor in causing the release of these prostaglandins [25, 
26]. The effect of PGE in causing cervical softening in the 
ewe is also well documented [27-29]. In both nonpregnant 
and pregnant cows, PGE 2 causes an increase in cervical 
opening within 3 h of treatment [30]. PGE! administered 
intra-cervically in a jelly was shown to decrease cervical 
resistance within 24 h [31]. 

Recently, we reported [6] that oxytocin caused a signif- 
icant stimulation of PGE 2 production in vitro in cervical 
tissues from pre-estrous/estrous cows but had no effect on 
PGE 2 production in cervical tissue from other stages of the 
estrous cycle of the cow. Similarly, in an initial report [31], 
it was shown that administration of oxytocin to pre-estrous 
cows increased the concentration of PGE 2 in the cervical 
exudate. However, peripheral oxytocin concentrations dur- 
ing estrus are lower than during the luteal phase [32, 33], 
and elevations of both peripheral oxytocin and cervical 
oxytocin receptor concentration are necessary for oxytocin 
to cause cervical softening towards parturition [24]. Fur- 
thermore, progesterone in vitro induced a dose- dependent 
inhibition of PGE 2 release by cervical tissues from pre- 
estrous/estrous cows, and this was associated with a de- 
crease in both basal and oxytocin-stimulated PGE 2 produc- 
tion [6]. It would therefore seem that hormones other than 
oxytocin are responsible for the increase in cervical PGE 2 
in the pre-estrous/estrous cow. The present report indicates 
that FSH, which has its peak peripheral concentration at the 
time of estrus, could be the hormone that increases cervical 
PGE 2 , as the FSHR expression is maximal at this time and 
FSH in vitro increases PGE 2 production by the cervix. 

The bovine cervix at pre-estrus/estrus has high levels of 
FSHR protein and its corresponding mRNA. Activation of 
the receptor by FSH is associated with the G-protein-cou- 
pled receptor family that mediates the cAMP and IP sig- 
nalling pathways. These signalling pathways then increase 
the expression of cyclooxygenase and production of PGE 2 . 
The expression of the FSHR was maximal at the time of 
the FSH peak in the blood and suggests a physiological 
role for FSH in the relaxation and opening of the cervix at 
estrus. 
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The alpha(v)beta6 integrin receptor for Foot-and-mouth disease virus is 
expressed constitutively on the epithelial cells targeted iu cattle. 

Mona K han P , GoldS, Simpson J, Zhang Z, Weinrcb PH . Violcttc SM . Alexandersen 
S, Jackson T . ~ 

Institute for Animal Health, Pirbright Laboratory, Ash Road, Pirbright, Surrey GU24 
ONF, UK. 

Field strains of Foot-and-mouth disease virus (FMDV) use a number of alpha(v)- 
integrins as receptors to initiate infection on cultured cells, and integriris are believed to 
be the receptors used to target epithelial cells in animals. In this study, 
immunofluorescence confocal microscopy and real-time RT-PCR were used to 
investigate expression of two of the integrin receptors of FMDV, alpha(v)beta6 and 
alpha(v)beta3, within various epithelia targeted by this virus in cattle. These studies show 
that alpha(v)beta6 is expressed constitutively on the surfaces of epitheliaL cells at sites 
where infectious lesions occur during a natural infection, but not at sites where lesions 
are not normally formed. Expression of alpha(v)beta6 protein at these sites showed a 
good correlation with the relative abundance of beta6 mRNA. In contrast, alpha(v)beta3 
protein was only detected at low levels on the vasculature and not on the epithelial cells 
of any of the tissues investigated. Together, these data suggest that in cattle, 
alpha(v)beta6, rather than alpha(v)beta3, serves as the major receptor that determines the 
tropism of FMDV for the epithelia normally targeted by this virus. 

PMED: 161 8623 1 [PubMed - in process] 



10 9: Int J Cancer. 200 3 Jim 20;l05(3):353-60. 
IflterScicnce* 



Related Articles, Uriks 



Qrolcinase-mediated posttranscriptional regulation of urolcinase-receptor 
expression in aon small cell lung carcinoma. 

Montuori N, Mactiello A, Mancini A, Taglialatela l\ Caputi M , Rossi G, Ragno P . 

Istituto di Endocrinologia ed Oncologia Sperimentale, Consiglio Nazionale delle 
Ricerche, Naples, Italy. 

The urokinase-type plasminogen activator (uPA) and its cellular receptor (uPAR) are 
involved in the proteolytic cascade required for tumor cell dissemination and metastasis, 
and are highly expressed in many human tumors. We have recently reported that uP A, 
independently of its enzymatic activity/is able to increase the expression of its own 
receptor in uPAR-rransfected kidney cells at a pes transcriptional level. In fact, uPA, 
upon binding uPAR, modulates the activity and/or the level of a mRNA-stabilizing factor 
that binds the coding region of uPAR-mRNA. We now investigate the relevance of uPA- 
mediated posttranscriptional regulation of uPAR expression in npn small cell lung 
carcinoma (NSCLC), in which the up-regulation of uPAR expression is a prognostic 
marker. We show that uPA is able to increase uPAR expression, both at protein and 
mRNA levels, in primary cell cultures obtained from tumor and adjacent normal lung 
tissues o f patients affected by NSCLC, thus suggesting that the enzyme can exert its 
effect in lung cells. We investigated the relationship among the levels of uPA, uPAR and 
uPAR-mRNA binding protein(s) in NSCLC. Lung tissue analysis of 35 NSCLC patients 
shows an increase of both uPA and uPAR in tumor tissues, as compared to adjacent 
normal tissues, in 27 patients (77%); 19 of these 27 patients also show a parallel increase 
of the level and/or binding activity t>f a cellular protein capable of binding the coding 
region of uPAR-mRNA. Therefore, in tumor tissues, a strong correlation is observed 
among these 3 parameters, uPA, uPAR and the level and/or the activity of a uPAR- 
mRNA binding protein. We then suggest that uPA regulates uPAR expression in NSCLC 
at a posttranscriptional level by increasing uPAR-stability through a cellular factor that 
binds the coding region of uPAR-mRNA. Copyright 2003 Wiley-Liss, Inc. 

PMID: 12704669 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor expression correlates with matrix 
metalloproteinases MT1-MMP, MMP-2 and MMP-9 in human 
glioblastomas. 

MunaujC, Noel A, Hougrand O , Foidart JM Bonivgrj n^^rM' 

Laboratory of Tumour and Development Biology, University of Liege, Liege, Belgium. 

Vascular endothelial growth factor (VEGF) is the major endothelial mitogen in central 
nervous system neoplasms and it is expressed in 64-95% of glioblastomas (GBMs) 
Tumour cells are the main source of VEGF in GBMs whereas VEGF receptors (VEGFR- 
1, its soluble form sVEGFR-l, VEGFR-2 and neuropilin-1) are expressed predominantly 
by endothelial cells. Infiltrating tumour cells and newly-formed capillaries progress 
J^f^ extracellu,ar matrix b y-local proteolysis involving matrix metalloproteinases 
(MMPs). Recent studies have shown that VEGF expression and bioavailability can be. 
modulated by MMPs. We reported previously that the expression of MT1-MMP in 
human breast cancer cells was associated with an enhanced VEGF expression We used 
quanhtauve RT-PGR, Western blot, gelatin zymography and immunohistochemistry to 
££E T^T? Si0 " ° f VEGF ' VEGFR - 1 . VEGFR-2, sVEGFR-1, neuropiIin-l r MTl- 
MMP, MMP-2, MMP-9 and TIM.P-Zin 20 human GBMs and 5 normal brains The 
expression of these MMPs was markedly increased in most GBMs with excellent 
correlation between mRNA and protein levels; activated forms of MMP-2 and MMP-9 
were present in 8/18 and 7/1 8 of GBMs. A majority of GBMs (17/20) also expressed high 
IS 3S prev,ousIv re POrted, with strong correlation between VEGF and MT1 - 

MMP gene expression levels, and double immunostaining showed that VEGF and MTI- 
MMP peptides co-localize in tumour and endothelial cells. Our results suggest that the 
interplay between metalloproteinases and VEGF previously described in experimental 
tumours may also be operative in human GBMs, Because of its dual ability to activate 
MMP|.2 and to up-regulate VEGF, MT1-MMP might be of central importance in the 

, . MS 3nd re P resent 311 interesting target for anti-cancer treatments. Copyright 
2003 Wiley-Liss, Inc. " 
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MT1-MMP, MMP-2, MMP-9 aiid TTMP-2 in a series of 20 GBMs 
and 5 normal brains. Using quantitative RT-PCR,. gelatin zymog- 
rapby, Western blot and immunohistochernistry, we showed a 
strong correlation between the expression of VEGF, MT1-MMP, 
MMP-2 and MMP-9 in GBMs. These results are in accordance 
with previous in vino studies and add to the evidence of an 
interplay between VEGF and MMPs in the progression of human 
GBMs. 



MATERIAL AND METHODS 

Patients 

We studied 20 GBMs diagnosed at the Laboratory of Neuropa- 
thology-CHU Liege between 1997 and 2001. The series included 
17 primary GBMs (lc., no previous history, of lower grade diffuse 
astrocytoma) and 3 secondary GBMs (/.«., previous history of 
lower grade diffuse astrocytoma). Clinical information on these 20 
cases have been reported previously as part of a larger series. 31 The 
gender ratio was 1/1, and the age at time of diagnosis ranged from 
41-79 years (mean 56 years). Normal brain cortex and white 
matter were obtained from 5 patients with intractable epilepsy 
treated by partial temporal lobectomy. Histological examination of 
these specimens showed severe hippocampal sclerosis; frozen tis- 
sue was sampled from microscopically normal inferior temporal 
gyri. Our study was approved by the Ethical. Committee of the 
Faculty of Medicine of the University of Liege. 



TINA extraction and cDNA synthesis 

Total RNA was extracted from cryosections with RNeasy Mini 
Kit (QIAGEN GmbH, Hilden, Germany) according to the manu- 
facturer's protocol Total RNA (1 |ig) was reverse transcribed with ; 
a ThermoScript reverse transcriptase (TherrhoScript RT-PCR Sys- 
tem, Invitrogen, Carlsbad, CA) and random hexamers as primers. 

Primers 

: Primers pairs used in our study are describe d in Table I. Primers 
for the VEGF gene were chosen to distinguish between VEGF )K? , 
VEGF )W> VEGF I43 and VEGF 12I mRNA isoforrns. Intron-span- 
ning primers • and probes for the TaqMan system (primers for • 
VEGFR-1 (Flt-lXsVEGrTl-|,VEGra-2(ia>R^ : l)andNRPl) 
were designed to meet specific criteria by using Primer Express 
software (Perkin Elmer, Foster City, CA\ All primers were syrn 
thesized by Eurogentec (liege, Belgium). The 5'- and 3'-end 
nucleotides of the probe were labeled with a Teporter (FAM = 
6-carboxy-fh)6rescein) and a quencher dye (TAMRA = 6-car- 
boxy-tetramethylrbodaroine). We conducted BLASTn (National 
Center for Biotechnology Information, Bether>da) searches against 
dbEST and the non redundant set of GenBank, EMBL, and DDBJ 
database sequences to confirm the total gene specificity of the 
nucleotide sequences chosen for the primers. The specificity of the 
amplified PCR products-was confirmed either by restriction digest 
or by sequencing. The 18S ribosomal RNA was measured using 
the Pre-Developed TaqMan Assay Reagents Endogenous control 
kit from Applied Biosystems (Foster City, CA). .* 



TABLE 1- SEQUENCE OF PRIMERS AND TaqMan PROBES USED FOR RT-PCR STUDIES 



Gene and accession 
number 


Pbsition. 


MMP-2 FP 


I740F 


MMP-2-RP . 


1964R 


NM„004530 . 




MMP-9-FP 


1592F 


MMP-9-RP 


1800R 


J05070 




MMP-14-FP * 


1288F 


MMP-14-RP 


1508R 


NM_004995 




TIMP1-FP 


: 78F 


T1MPI-RP 


245R 


M 12670 




T1MP-2-FP 


78F 


T1MP-2-RP 


245R 


NM 003255 




VEGF-FP 


1208F 


VEGF-RP 


1687R 


AH001553 




28S rRNA-RP 


12403F 


28S rRNA-RP 


12614R 


UI3369 




VEGFR1-FP 


2438F 


VEGFR1-RP 


25J6R 


VEGFR1 Probe 


2469 


AF063657 




sVECFRJ-FP 


2209F 


sVEGFRJ-RP 


2388R 


sVEGFRJ Probe 


2257 


U0I134 




VEGFR2-FP 


79IF 


VEGFR2-RP 


946R 


VEGFR2 Probe 


820 


AF063658 




NRPI-FP 


I83>F 


NRPI-RP 


1942R 


NRP1 Probe 


1883 


XM 034725 





Sequence 



5 '-AG ATCTTCTTCTTC A A GG A CCGGTT-3 ' 
5VGGCTGGTCAGTGGCTTGGGGTA-3' 

5'-GCGGAGATTGGGAACCAGCTGTA-3' 
S'-GACGCGCCTGTGTACACCCACAO' 

5'-GGATACCCAATGCCCATTGGCCA-3' 
5'-CCATTGGGCATCCAGAAGAGACC-3' 

5'CATCCTGjTGTTGCTGTGGCTGAT-3' 
5'-GTCATCTTGATCTCATA ACGCTGG-3 ' 

5'-CTCGCTGGACGTTGGAGGAAAGAA-3' 
5 r -AGCCCATCTGGTACCTGTGGTTCA-3' 

5'-CCTGGTGGACATCTTCCAGGAGTA-3' 
5'-CTCACCGCCTCGGCTrGTCACA-3' 



5'-GJTCACCCACTAATAGGGAACGTGA-3' 
S'-GATTCTGACTTAG A GGCGTTCAGT- 3' 

5'-TCCCITATGATGCCAGCAACT-3' 

5'-CCAAAACCCCCTCTTCCAA-3' 

5'-CCGGGAGAGACTTAAACTGGGCAAATCA-3' 

5'-ACAATCAGAGGTGAGCACTGCAA-3' 
5'-TCCCAGCCTGAAAGTTAGCAA-3' 

5'-TCCAAATTTAAAAGCACAAGGAATGATTGTACCAC-3' 

5'-CTTCGAAGCATCAGCATAAGAAACT-3' 

5'-TGGTCATCAGCCCACTGGAT-3' 

5'-AACCGAGACCTAAAAACCCAGTCTGGCAGT-3' 

5'-CACAGTGGAACAGGTGaTGACTTC-3' 
5'-AACCATATGTTGGAAACTCTGA TTGT-3'. 
5'-CCACAGAAAAGCCCACGCTCATAGACA-3' 



225 bp 
203 bp 
221 bp 
163 bp 
15:5 bp 



479 bp 
407 bp 
347 bp 
275 bp 

212 bp 



79 bp 



180 bp 



156 bp 



112 bp 



Cycles 

33 

32 . 
■ 33 
30 
33 

19 
40 

40 

40 

40 



1 

J 
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6000 



5500 



5000 



4500 



4000 



^ 3500 



* 3000 
g 2500 



2000 



1500- 



1000 



VEGF VEGFR1 sVEGFRI VEGFR2 HHP 

JoTZ Jur, w F "S "ceptors mRNA quanfification: scat- 
spots) roRNA levels are expressed as normalized values (as described 
senrs ine mean of 3 separate experiments. 

T1MP-2 mRNA and. VEGF mRNA isofomu 

for™™™- M *JP-2. MMP-9, nMP-2 a„d VEGF mRNA iso- 
fonro (VEGF 1W , VEGF 1M , VEGF )43 and VEGF,,.) were mea- 
sured ,„ l0 ng abbots of cDNA ^ Taq polyrnerast £££ 
Shiga, Japan and 5 pmol of each primers (Tabte ]). The tbenna 

an^T" ,T' Uded ^ 95 ° C fw *•*■»*■. and 

OTfflt f °"v!rf "i 94 °- 20 6600 and 20 sec at 

72»r t£p ^. ,Sofoims) wilh a final iwbwiw 2 min a. 
Ju ,\v%J!°? 0C V' at leso,Ved 0D 2 * Nos » e ^ 3:1 agarose 
gels (BroWhrttake,. Rockland, MD) and analyzed nsing a FWS 

^SX^iA 1 ^ CA) af ' CT e * idium 
sta.mng. Spec.fic mRNA levels were expressed as Ibe ratio of 

sp«,6c t^scnpts^SS transcript, Experiments Were reputed a! 

least J limes in duplicate. 

TJ'NRPjZmA' iV ' PCRf0r VEGFR ' 1 ' sV£GFR1 - v EGFR-2 
Real-lime quantitative RT-PCR analyses for VFnPB i 
s VEGFR- 1 VEGFR-2, NRP1 mRNAs and m WA S 
ned out using ibe AB1 PRISM 7700 Sequence Detecuon System 

«Vh P k pa " S J md ""O'OSWC Probes that were used for 
5 fotf •*? M T3bfe 1 A SWndafd generated by 

50.000-._80 ng and was run ,„ duplicate during every experiment 
Fo, each expenmental sample, ,he amount of u,ro el ^nT^ 

of the target gene was normabzed against 18S tRNA to compen- 



w va ^ a,,0 ? 1 .'" 9» a 'i'y of RNA and the amount of input 
cDNA (as described bythe manufacturer PE Applied Biosystems 

SSl^^?*' M, , x < A « ,,ied Biosyste m s)using5 u.1 of diluted 
cDNA (equvalent to 1 0 ng total RNAX 200 nM of the probe, and 
400 nM pnmeis in a 25 pj final reaction mixture. After a 2 min 
incubation at 50»C to allow for UNG cleavage, AmpliTaq Gold 
activated by an incubation for lOmiii at 95°C Each of the 40 
PCR cycles consisted, of 15 sec of denaturation at 95°C and 
hybridization of probe and primers for 1 min at 60°C. 

To confirm amplification specificity, the PCR products were 
also examined by subsequent 2% agarose gel electrophoresis. 
Experiments were repeated at least 3 limes in duplicate. 

lmmunohutochemisiry for VEGF and MTJ-MMP 

Sections (4 urn thick) were cut from formalin-fixed, paraffin 
embedfcd tuipour ^ tissue. They were hydrated through graded 
alcohols and mcubated in H 2 0 2 (03% 15 min). Sections were 
au^lavedfor 11 mi„ at 126»C in citrate buffer pH6 for antigen 
retrieval (Dako. Glostrup, Denmark). For double LmunostaiW 

STa^ ]°^lrt m P ' an ? y monoclonal A >» 
r^Tr Al>4 >' 100 .( ( ^">g™e Research Products, San Diego. 
CA) followed by peroxidase-conjugated EnVision (Dako) Imimi- 
noreactmty was visualized with 33'diaroinobenzidine ADAB+ 
™' fl^r?/c Were „ lhen »™bated with polyclonal Ab aW 
VEGF 1:150 (Santa Cruz. Santa Cruz, CA) for 1 ht at room 

« by a ? ka,iDe 

y ^"""^"Vty for VEGF was visualized with . 
Fast Red chromogenic substrate (Dako). Single immunostairring 
was also carried out on serial sections using each primary antibod? 

N^tiv, r , " eSp0, ! di, ? g ^yme-chromogene combination 
Negative controls were obtained by omitting the primary anlibod- 

Gelatin zymography assay 

MMP-2 and MMP-9 activities were quantified by gelatin zy- 
mography on 2 normal brains and 18 GBMs. Ten cryosections (JO 
M-M) were W>genized in buffer (0.1 M Tris-HQ pH 8.1, 0.4% 
Jnton X-100) and centrifuged for 20 min at 5,000*. The pellets 
were discarded. 25 p.g of total protein from homogenate supema- 

? n D f Vn C u 1X *« ^ "° n rcducin S sample buffer (62.5 mJVi 
Tns-HCl pH 6.8; 2% SDS; 10% gJyceroJ; 0.1% bromopherml 

wc^o^JS* 0 '"^ dircc,] y on l0% SDS-polyacryiamide 
gels (SDS-PAGE) containing 0J% gelatin (w/y)" Afier electro- 
phoresis, gels were washed for ) hr at room temperature in a 2% 
«a } T ™™. X ' m solu,io " »<> remove SDS„ transfened to a buffer 
^0 mM Tris-HQ pH 7,6. containing 10 mM CaQ,) and incu- 
bated for 18 hi at 37<>C. Gels were stained for 30 mm with 0.1% 
(w/v) Coom asS .e brilliant blue G250in 45% <v/v) methanol/10% 
- (v/v) acetic acid and destained in 10% (v/v) acetic acid/20% (v/v) 
™ C ?! WeiC ana,v ^ e d w i»h Quantity One software (ver- 
sion 4.2.2, Bio-Rad Laboratories, Hercules, CA) after densitomet- 
ry scanning of the o e ] s using a Fluor-S Multimager (BioKad). 

Western blot 

MT1-MMP protein levels were analyzed in 2 normal brains and 

k r it B !?Z C ' MraClS (25 weie ™wcd with 1/2 sample 
buffer {0.25 M Tns < P H6,8), 10% SDS (w/v), 4% sucrose (v/v), 
a I J7 loelha w>» < vA ') and 0. 1 25% bromophenol blue (w/v)) 
and boded for 5 mm. They were separated on 10% SDS-PAGE 
gels and transferred lo a PVDF filter (NEN, Boston. MA). After 
blocking w,th 5% milk (w/v), 0.1% tween 20 (w/v) in PBS for 2 
hr at room lempeiature. membranes were exposed to the primary 
armbody (10 ixg/ml clone 113-5B7, Ab-4, Oncogene Research 
Products, San Diego, CA) at 4°C overnight followed by incubation 
with a horseradish peroxidase-conjugated rabbit anli-mouse anti- 
body (1.3 u.g/ml, Dako, Glostrup, Denmark). Signals were de- 
tected with an enhanced chemoluminescence (ECL) kit (NEN. 
Boston, MA). The relative intensities of the immunoreactive bands 
were analyzed w.th Quantity One software (version 4 2 2 Bio-Rad . 
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221 bp 
225 bp 

155 bp 



_ m um> - ^ » -» " 20B bp 



Figure 2-MMPs am) T1MP-2 mRNA quantifica- 
tion, (a) Representative 2% agarose gels of RT-PCR 
products for MTI-MMP, MMP-2, T7MP-2 and 
MMP-9 in 2 normal brains (NB) and 10 GBMs. (fr) 
Scatter plots (as described in Fig. 1). Experiment was 
repeated at least 3 times in duplicate. 



Laboratories) after densitometric scanning of the X-ray films using 
a Fluor-S Multimager (Bio-Rad). 

Statistics 

VEGF, . VEGFRs, MMPs and HMP-2 expression values in 
GBMs were correlated using Spearman's test. Correlation was 
considered significant for 2-tailed p-value < 0.05. Statistical anal- 
ysis was carried out using the Prism 3.0 software (GraphPad, San 
Diego, CA). 

RESULTS 

Expression of VEGF and VEGF receptors 

VEGF mRNA was present in normal brains (295-375, arbitrary 
units; mean = 322) and in all GBM samples (2 17-5, 1 12; mean = 
1/774) as reported previously (Fig. I). 3 ' In most GBMs, VEGF 
mRNA levels were raised 2- 1 5- fold above normal brain values. 
Tbe most abundant isoform in all cases was VEGF I65 , followed by 
VEGF 1?l , VEGF 189 and VEGF M5 (data not shown). VEOFR-J 
expression was found at similar levels in GBMs (89-357; mean - 
182) and normal controls (154-198; mean = 181). There was no 
correlation between VECFR-J and VEGF . mRNA levels (p = 
035) in GBMs. VEGFR-2 was expressed in all GBMs (48-582; 
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^ .300 - 
P. 
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a> 

o 200 
< 

j| 150 
E 

100 
50 
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•2 s 



MT1 



MMP-2 TIMP-2 MMP-9 



mean = 210) and in 8/20 cases at least twice npnnal values 
(87-111; mean = 103). VEGF and VEGFR-2 expressions were 
correlated significantly (p = 0.0035) in GBMs: NRP1 expression 
varied broadly between GBMs (75-3,260; mean = 1.061) con- 
trasting with a constant baseline expression in notrnal controte 
(383-397; mean - 390). In rumours, NRP1 correlated w,tn 
VEGFR-2 (p = 0.0119) but not with VEGF (p = 0084), nor 
VEGFR-1 (p = 0.066). sVEGFR-i. was expressed at low levels 
both in normal brains (84-92; mean - 87) and GBMs (25-^f' 
mean = 101). sVEGFR-1, however, was found to correlate wjin 
VEGFR-) (p = 0.0289), VEGFR-2 (p = 0.0029), and HRP1 (P - 
0.0027) but not with VEGF ip - 0.053). 

Expression of MMPs and TJMP-2 

MTI-MMP, MMP-2 and MMP-9 were expressed in both nor- 
mal brains and GBMs but at much higher levels in the latter (Hg- 
2a). MTI-MMP mRNA levels were constantly higher id OiJfvjs 
(34-202; mean = 106) than in normal controls (3^29; mean 
15) MMP-2 and MMP-9 mRNA levels were higheT than controls 
in 1 8/20 and 1 4/20 cases respectively (Fig. 2b) and con elated witf) 
each other (p = 0.01 87). MTI -MMP.mRNA levels correlated I with 
MMP-2 (p - 0.0008) and MMP-9 (p = 0.005). ™ ,p -2 I 
non-discriminative distribution in relation to the controls. T3MF-/ 
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Rgure 3 - Zymographic analysis of MMP-2 and MMP-9 in tissue 
S^-?* 1 ? from norma) W or GBMs <GB)Xdimn lo£ 
Aboncd by human HT1080 cells wis included as S con^roL 

Sby MMP_9> Prt> - MMP - 2 ^Maffll 



1 Vetera Wot analysis using the ab4 antibody (clone 
1 13-5B7) raised against the catalytic domain of m\-MMP 
extracts from MTl-MMP transfected A2^ cel^o^'^e^ 



AND TIMP-2 EXPRESSED AS p V^UK dSjvTD^ ' 
;_. FROM SPEARMAN* S TEST 



MTl-MMP 



MMP-2 



MMP-9.- 



VEGF 

VEGFR-1 

VEGFR-2 

NRP1 

sVEGFR-1 



TIMP-2 



0.Q250 
0.0073 
<0.000I 
0.0053 
0.0313 



proMMP-? 


Nl 


0.41 


N2. 


0.28 


GBJ 


9.00 


GB2 


4.28 


GB3 


1.67 


GB4 


1.40 


GB5 


0.47 


GB7 


5.29 


GB8 


5.75 


GB9 


5:02 


GBI2 


5.08 


GB13, 


6.86 


GB14 


5.59 


GBI5 


8.00 


GBJ6 


5.55 


GB17 


6.61 


GB18 


14.79 


GB20 


18.85 


GB21 


16.52 


GB22 


20.66 • 



Grfaiip zymo^by .WetftmMoi 
MMP» proMMP-2 MMP-2 MTl-MMP 



0.0245 

0.0710 
0.0168 
0.1334 
0.0469 



. 0.0053 
<0.0001 
0.0004 
0.2457 
0.0194 



0.0094 
0.4542 
0.0153 
0.5480 
0.2563 



0.00 

0.00 

1.25 

0.00 

0.00 

0.00 

0.00 

0.00 

3.78 

0.00 

0.00 

4.58 

6.75 

0.00 

1.10 

2.04 

4.34 

0.00 

0.00 

0.00 



1.52 

0.83 

2.45 

2,88 

2.86 

4.33 

1.15 

1-12 

4.63 

4.09 

4.79 

4.93 

5.53 

5.13 

5.66 

6.00 

8.20 

2.12 

2.37 

5.11 



0.00 

0.00 
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0.09 

0.00 

0.03 

0.00 

0.00 

0.65 

0.00 

0.20 

0.48 

0.76 

0.00 

0.00 

0.00 

0.40 

0.00 

0.00 
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O^T^' e - Nfr, MMP (P = 0.0019) and MMP-2 (p = 
0.0002) but not with MMP-9 (p = 0.1408) 

*£L fn'ftelS ?R 8 M to.^MMF-i a»d proMMP-9 were de- 
tected >n Ite 18 GBMs and 2 comrols examined (fie 3 Table m 

t™S£* MS - IT* ° f,heSe iD3C,ive '«* 4« Than "„ 
norma) brams and were correlated wi.h tbeir respective mRN A 

MMK-z and MMP-9 were oot found in normal brain By contrast 

GRM J-T l Ve,S WCTC * Western blot in 15 

GBMs and 2 normal brams (Fig. 4. Table 11). They were signifi 



^l^^^^^T^^^ "*vated MMP-2 levels 
(p - 0.0226) but not with activated MMP-9 levels iv = 006i 

1 .T-^TV^ Pr ° tein ^ levels were 

^ed s,gnjficanUy (p = 0.089), aigui D g for a predominantly Iran- - 

scnptional regulation in GBMs: 

Correlation between VEGF network and MMPs 

M^mp mLo V^JS* VEGF compared to 

MTl-MMP, MMP-2, MMP-9 and TTMP-2 (Table HI). Tnere was 

L^x^o 0irc J a ^ n bc!ween VjEGF expression and MT1- 
MMP^ MMP-2 arid MMP-9. A similar correlation was also ob- 
served between VEGFR-2 and MMPs. Interestingly HMP-2 ex- 

^vr^rf COne,aled VEGF and VEGFR-2 but not with 
other VEGF receptors. 

lmmunohistochemistryfor VEGF and MTl-MMP 

VEGF immunoreactivity was shown in both tumour and endo- 
thehal cells, ^viously reported (Fig. 5aJ>)?\ By single irhmu- 
nostarmng. MTl-MMP was detected in glioblastoma cells as a 
diffuse cytoplasmic staining (Fig. 5 Ct d) - MTl-MMP positiviry was 
also seen m endothelial cells and perivascular cells (Rg 5c). By 

!J ^?f^^ St . a,niDe, WC 0bserved the co^locafoation of VEGF 
and^MTl-MMP in the cytoplasm of numerous tumour cells (Fig. 

BJSCUSSJON 

GBMs are highly malignant tumours with poor prognosis. They 
show major microvascular proliferation and express high levels of 
vtor*. VfcfJF 1S a strong mitogen for endothelial cells mereby 
promoting ang,ogenesis. Previous reports"^ have sug ge StM l that 
VEGF also stimulates tumour cell invasion, migration arid survival 
in malignant epithelial cells through an autocrine loop by which 
overexpression of MMPs induces VEGF secretion and leads to 
subsequent amplificauon of cell proliferation and protection 
against apoptosis We and others reported previously that in human 
melanoma and breast carcinoma cells, MTl-MMP upregulates 
VEGF expression whereas TIMP-2 reduces it.^ ^ Therefore the 
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Figure 5 - Irarounohistocbemis- 
iry (scale bar - 50 pro). (ogb)VEGF. 
positive tumour and endothelial cells 
(plain arrows) show granular red 
staining of the cytoplasm. Negative 
celts (empty arrow) are seen in their 
close vicinity, (c) MT1-MMP posi- 
tive cells show a strong brown cyto- 
plasmic staining. They include tu- 
mours cells (plain arrow) and • 
endothelial cells (empty arrow), (d) 
MT1-MMP positive tumour cells 
(plain arrow) are mixed with nega- 
tive cells (empty arrow): {ej) Double 
staining with VEGF (red) and MTI- 
MMP1 (brown). Double positive tu- 
mour cells (plain arrows) contrast 
with negative or single weakly pos- 
itive cell (empty arrow). 



pericellular proteolysis mediated by MTJ-MMP in GBMs could 
also induce an autocrine loop resulting in enhanced VEGF expres- 
sion. In rum, VEGF could act as a paracrine factor on endothelial 
cells to stimulate angiogenesis. or possibly as an autocrine factor 
promoting glioblastoma cells survival-migration and invasion as 
demonstrated recently in the various tumour eel! culture models. 

We compared the expression of VEGF and ils receptors with 
MTI-MMP, MMP-2 and MMP-9 in 20 GBMS and 5 normal brains. 
The expression of these MMPs was markedly increased in most 
GBMs with excellent correlation between mRNA and protein levels. 
MT1-MMP expression has been shown previously to correlate with 
glioma aggressiveness and its transfection in different tumour cell 
lines triggers an angiogenic phenotype and promotes tumoxir 
growth. 20 - 29 - 30 - 3 ** 38 A majority of GBMs ( 17/20) also expressed high 
levels of VEGF, as previously reported, with a strong correlation 
between VEGF and MT1-MMP gene expression levels. Double iro- 
munostaining studies showed co-expression of VEGF and MTI- 
MMP by the same rumour cells. These data suggest that the transcrip- 




tional control of VEGF by MTJ-MMP could be operative not only in 
vitro but also in vivo in human GBMs. 

MT1-MMP could also promote the growth of GBMs by its 
ability to activate MMP-2 in the presence of low concentration of 
TIMP-2.- 10 Pro-MMP-2 activation occurs after the formation of a 
ternary complex that contains pro- MMP-2 linked to cell surface 
■MTJ-MMP via a TIMP-2 bridge. Jn accordance with this hypoth- 
esis, we found that MMP-2 activation occurred ii> 8/ 18 of out 
GBMs 20 * 11 among which 7/7 tested for MTJ-MMP showed high 
contents of this protease. 

Activated MMP-9 was also found in 7/18 of our GBMs. This is 
an interesting finding as active MMP-9 is able to mobilize VEGF 
from its ECM reservoir. 39 Therefore, MMPs could promote 
VEGF-mediated angiogenesis in GBMs by both transcriptional 
(MT1-MMP) and post translational (MMP-9) mechanisms. 

VEGF binding to VEGFR-2 triggers the proliferation and mi- 
gration of endothelial cells whereas ils binding to VEGFR-J has 
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^^f? 6 dfetts 0,1 & ob}asl °™ cell lines. Jn our stud ^ V£GF 
mRNA levels were cwrelarcd with VEGFR-2 but not VEGFR-] 
NRPI and sVEGFR-1. Collectively our data suggest that GBMs 
splay ^specific and complex pattern of VEGF receptors, trans- 
ducing VEGF signaling toward cell proliferation and migration 

Jl^^L 001 ^ *** 10 for 311 interplay 

brtween irietalloprotejrases and VEGF in human GBMs as prev^ 
wsjy documenied m experimental tumours. Because of hs daaJ ability 
to activate MMP-2 and to up-iegulate VEGF, MTI-MMP mi «ht be of 
central importance m the growth of human ghoWastomas and repre- 
sent an interesting target for anti-cancer treatments. 
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DNA hypcrraethylatioa is a mechanism for loss of expression of the HLA 
class I genes in human esophageal squamous cell carcinomas. 

£i?*L?£> Xan£jGr, Songjf, ZhatfX, SoC, LiaoJ, WangJLD, YajigJTS. 

Laboratory for Cancer Research, College of Pharmacy, Rutgers-The State University of 
New Jersey, 164 Frehnghuysen Road, Piscataway, NJ 08854-8020, USA. 

The three human leukocyte antigen (HLA) class I antigens; HLA-A, KLA-B and HLA-C 
play important roles in the elimination of transformed cells by cytotoxic T cells Frequent- 
loss of express«on of these antigens at the cell surface has been observed in many human 
cancers. Various mechamsms for post-transcriptional regulation have been proposed and 
tested but the molecular mechanisms for transcriptional regulation are not clear. We show 
by immunohistochemistry that the HLA class I antigens are absent in 26 of 29 (89%) 
samples of human esophageal squamous cell carcinomas (ESCC). Eleven of the 26 ESCC 
samples lost mRNA expression for at least one of the HLA genes, as shown by RT-PCR 
DNA from the 29 pairs of ESCC and neighboring normal epithelium were examined for 
CpG island hypermethylation, homozygous deletion, microsatellite instability (MSI) and 
loss of heterozygosity (LOH). DNA from normal epithelial tissues had no detectable 
r"!wV ati r ° f 016 Cp ° isIands ° f any of the se gene loci. Thirteen of 29 ESCC samples 
(4^) exhibited methylation of one or more of the three HLA loci and six samples (21%) 
exhibited methylation of all three loci. The HLA-B gene locus was most frequently 
methylated (38%). HLA-B mRNA expression in an ESCC cell line, where HLA-B was 
hypermethylated and did not express mRNA, was activated after treatment with 5-aza-2'- 
deoxycytidine. Homozygous deletion of these three gene loci was not observed 
Relatively low rates of LOH and MS I were observed for the microsatellite markers 
. D6S306, D6S258, D6S273 and D6S1666, close to the HLA-A, -B and -C loci, although a 
nigh ratio of LOH was observed at a nearby locus (represented by the markers D6S 1 05 1 
and D6S1560), where the tumor suppressor gene P 21(Wafl) resides. A strong correlation 
between genetic alterations and mRNA inactivation was observed in the ESCC samples 
Our results indicate that HLA class I gene expression was frequently down-regulated in 
ESCC at both the protein and mRNA levels and that hypermethylation of the promoter 
regions of the HLA-A, -B and -C genes is a major mechanism of transcriptional 
inactivation. 

PMID: 11577000 [PubMed - indexed for MEDLINE] 
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Molecular and immunoliistocneinical analysis of HER2/ncu oncogene in 
synovial sarcoma. 

Department of Medicine, Surgery and Dental Sciences, University of Milan AO <5 
Paolo and IRCCS Ospedale Maggiore, Italy. * ' 

^nplification and/or overexpression of HER2/neu have been documented in many types 
of qnthehal tumor and recently has been reported in sarcomas, particularly in 

undTr^TS 1^ r ° le ° f HER2/neU a,terati0nS fa Soft tissue «™ ™™"s poorly 
understood. TTrus the present study investigates the expression of HER2/neu in 1 3 
patterns wtth synov.al sarcoma (SS). In this study, HER2/neu mRNA levels were 
measured m frozen tissue samples using a real-time reverse transcription-polymerase 
ZT^Sr ^ f Qt f * session was assessed by immunobistoehemistry using an 
, J^ ne , U , P ° ,yCl0nal aobbody - Six nomal skeIetal mu scle specimens were used to 
estab hsh basal levels of HER2/neu mRNA. HER2/neu tnmscrip/were deteS i" at 

e^ 

levels above the mean value, whereas 3 tumors (23%) displayed HER2/neu protein 

bvel7^- Tan ^SaZ ^ f ° Und b6tWeen P f0te - session and nW 
lowexri^ « J? HER2/neU mRNA leVCls WCre ^gnificantly associated with a 

lower nsk of developmg recurrences (P » 0,02). Moreover, none of the patients with 

ex D ^rir s e rf rr^, r ,oped u metastasis - our her^ is 

expressed m SSs and that both membrane and cytoplasmic HER2/neu expression 

ffiRtneu^^ ,eVdS - rCSUltS Sh ° W P-nc. of i„3Xds of 

HER2/neu tn SSs is associated with amore favorable clinical course. Further studies are 

Sit Z - J 01 ' ° f ^ ° nCOgene in SSS ™ d to evaluate the application 
tnh bttory humamzed monoclonal antibodies in the treatment regimens for this 
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Molecular and Immunohistochemical 
Analysis of HER2/neu Oncogene in Synovial 
Sarcoma 

PAOLO GIOVANNI NUCIFORO, MD CATERINA PFl l fodimi d r, 
SSSSSW *WKoK^S3 ^PEUEGRINIP*. 



d 0 c«^L^ M,t, ° ^ ^'"P"- 5 "" °f HH«/»e» have b«, 
" mai ' 7 ° rep!,b,Ji:J «*— »<« '««<>, b* b«, 
mS^T r COm "' •T it " l ""y •» ^-c—l Bu, i. of 

»«£ M T T S,nd,, "^"K"" «pr«ioo .f HER V 
HER2/ntm pofycloDal aoobodj. Six norm* i.teuj s „ ci . 

of unn.unos.ao*,,,, were observe, ^ , stroTg co^C^ 
peak incidence in adolescents and you™ adults Thi, 

TZtszz' major r r s - biphLic "SS^oSt 

IR.f , i rt ," ^ogeneucally characterized by the l(X- 
>8Kpll,qll) translocation /found in >95% of cases' 
Although traditionally considered to be a Lh-S 

fcrcn t^;"" n n inVCS,iga,i ° nS haVC 

hf f t > h * en «" c treatment strategies 

.he r^oidy status, and the apoptotic index.' 8 ' 

such LM ^ ° f nCW ■ he ^*"«ic advancements, 

weLTirfh ^ i"*^ 11 " 8 ° f m ° ,cfu)ar alteration 
present ,„ human mahgnancies, has broyght ,o light the 
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found k«w«D pro.eta expression and inRNA feTd (i> = o 0 rt 
;'° Wrmk ^«opi-g reewxences (P = 0 . 02) . Moreover „<, De „f 

need to identify not ordy prognostic factors, but also 
tumor features that are predictive of response to therapy. 

One of the most extensively studied molecular 
T^f H^V/ " HER - 2 ^» pro.ooncogene. 
lhe HER-2/neu oncogene (also known as c-erbB-2) 
located on chromosome 17q21, is a member of the 
tyrosine kinase receptor family and encodes for a 185- 
kjlodalton protein that shows 50% homology with the 
epiderma) growth factor receptor.'-* This gene is am- 
phned.and/or overexpressed in~20% to 30% of breast 
carcinomas ' anrl in vori MW A .i a 



' and in various other tumors, 6 and usually 
_b associated uid, , u „,or aggressiveness and poor prog- 

OTb'o/ ^ S '" dieS haVe SU PP°»^ *e value of 

HtR-2/nen to predict 0,e response to chemotherapy ,n 
breast cancer^rid the use or recombinant humanized 
anubodies to HER-2/neu protein (Trastuzumnb) i„ the 
care of pat.enu with advanced, metastatic breast tumors 
r>as been approved. 

The role of HER-2/„ e u activation in soft tissue 
tumors remains p OOI ly understood, and scarce molec- 
ula, data backing immunohkiorhemica) studies have 
been reported. HER-2/neu pio.ein expression was ,m- 
munoh^ochemically studied in 20<1 sarcomas, indud- 
>ng b S&, and overexpression was absent i„ all these 
malignant mesenchymal neoplasms ,0 

Recently, HEK-2/neu al.e.auons have bee, de- 
scribed in os.eosarcoma, KnU , a high incidence of pro- 
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TABLE 1. CINcopothologic Features of 13 Patients WithSS 



No. 


Age(y)/G 


Site 


Subtype 


Surgery 


Chemo/Radk> 


Rec 


Mets 


HER2 


FU(md) 


1 


68/F 


KI 


MF 


RR 


No 


Yes 


Yes 


L 


108 


2 


7i/y 


KE 


- BF 


WA 


No 


No 


' No 




12 


3 


27/M 


KI 


. BF 


MF. 


y« 


Yes 


Yes 


L 


24 




39/M 


T 


MF 


WE 


Yes 


No 


No 


H 


48 


5 


15/M 


N 


MF,PD 


WE 


Yes 


No 


No 


H 


48 


6 


4)/F 


F 


MFJ>D 


WA 


Yes 


No 


* No 


H 


48. 


. 7 


57/F 


KI 


MF 


VYA * 


No 


N/A 


N/A 


L 


N/A 


8 


48/M . 


F 


BF^D 


ME 


Yes 


Y« 


No 


L 


36 


9 


29/F 


T 


PD 


WE 


Yes 


Yes 


Yes 


L 


36 


10 


27/M 


KE 


BF 


WE 


No 


No 


No 


H 


24 


n 


62/F 


A 


MF 


WE 


No 


Yes 


No 


L 


24 


32 


62/F 


F 


MF 


WA 


No 


No 


No 


h 


2 


13 


34/F 


L 


MF 


RR 


No 


No . 


No 




2 



Abbreviations: Age, age at diagnosis; G,. gender, M„ male; F, female; Site, anatomic location; M, knee, intra-articular; KJ't; knee, 
extra-articular; T, thigh; N, neck; F, foot; A, arm; L, leg; Subtype, histological subtype; MF, monophasic fibrous; BF, biphasic; MF-PD, 
monophastc fibrous with, poorly differentiated areas; BF-PD, biphasic with poorly differentiated areas; PD, poorly differentiated; .IJurgeiy, 
primary surgical therapy; RR, radical en bloc resection; WA, wide through-bone amputation; ME, marginal cn bloc excision; WE^wide en bloc 
excision; Chemo/Radio, adjuvant postoperative chemotherapy and/or radiotherapy; Rec, recurrence; Mets, presence of metastasis; HER2, 
HERS mRNA expression; L, low. expression; H, high expression; FU, follow-iip status; N/A, not avaible. 



tein expression, ranging from 42% lo 61 %. n " 15 Indeed, 
despite limited information on Her2/neu in this type 
of malignancy, based mostly on immunohistochemical 
findings,' 2 clinical trials of Trastu2umab have been 
initiated for recurrent and metastatic osteosarcoma pa- 
tients (http://www.cancer.gov/c)inicaJ_trials: MSKCG- 
99097/NCI-T98-O083 and COCAOST01 2] ) . 

Therefore, we evaluated the mRNA expression and 
the gene product of HER-2/neu in 13 SS patients using 
reatoime reverse transcription-polym erase chain reac- 
tion (RT-PCR) and immuri ©histochemistry. To the best 
of our knowledge, this is the first report documenting 
increased levels of HER-2/neu mRNA and protein 
in SS. 



MATERIALS AND METHODS 

Patient Population 

Thirteen patients with primary SSs, obtained from the 
files of the Department of Pathology, Gaeiano Pini Orthope- 
dic Institute, were included in this study. Cases were chosen 
based on the availability of frozen primary rumor. Patient age 
ranged fiom 15 to 71 years (mean, 44.6 years). Anatomic sites 
included the knee (5 patients, 3 intra-arucular), thigh (2 
patients), foot (3 patients), arm (1 patient), neck (1 patient), 
and }<nw )eg (1 patient). The histoJcgical subry-pes **ere 3 
biphasic (BF), 6 monophasic fibrous (MF), 2 monophasic 
fibrous with poorly differentiated areas (MF-PD), 1 biphasic 
with poorly differentiated areas (BF-PD), and 1 poorly differ- 
entiated (PD). Clinical staging was JIB for all the patients. 
Local surgical excision was performed in 9 pa bents; amputa- 
tion, in 4 patients. Recurrence was observed in 5 cases with 
subsequent amputation (3 patients) and local excision {2 
patients) . Metastasis occurred io lung (2 patients) and ingui- 
nal lymph nodes (] patients). Five patients received adjuvant 
postoperative chemotherapy, and 1 patient (case 4) was 
treated with chemotherapy associated wuh radiotherapy. Fol- 
low-up ranged from 2 to 108 months (mean, 3-1.3 months). 
Clinicopathologic data are summarized in Table 1. 



Pathologic Studies and Frozen Tissue 
Selection 

In all cases, the primary tumor was avaibfole*foi study. 
Surgically resected tumor tissues weTe partly snap-frozen in 
liquid nitrogen and stored at — 80°C for RNA extraction, and 
partly fixed in buffered formalin and embedded in paraffin 
blocks. Hematoxylin and eosin-stained sections were re-«vaK 
uated and graded according to FNCLCG grading system. 16 
Frozen tissue blocks were handled as follows: 4-fHhick frozen 1 
sections were cut and stained with hematoxylin and eosin to 
determine the percentage of tumor cells present *in the spec- 
imen. We used tissue blocks with rumor cells comprising 
more than 80% of the specimen. About 10 20-ji-tbick section's 
were collected into Eppendorf tubes. Another 4-/i-thick fro- 
zen section was cut after the serial sections and examined by 
light microscopy to guarantee the percentage of tumor cells 
collected. Triiol (life Technologies; Gibco BRL, Gaiihers- 
burg, MD) was used for RNA extraction, according to the 
manufacturer's protocol. RNA was quantified spec tr ©photo- 
metrically. . 



cDNA Synthesis 

Total RNA (200 ng) was re verse- transcribed in a total 
volume of 50 /iL containing 1 X TaqMan buffer, 5.5 mmol 
MgCly, 1 mmol deoxynucleotides, 2.5 p-mol random hexam- 
ers, 20 U RNase inhibitor, and 62.5 U MuLv reverse transcrip- 
tase. The samples were incubated at 25°C for 10 minutes, 
48°C for 30 minutes and 95°C for 5 minutes. 



PCP Amplification 

Amplification reactions were performed with the Univer-. 
sai TaqMan 2 X PCR master mix in a volume of 25 jiL con- 
taining 300 nmol of each primer, 100 nmol of probe, and 5 
of cDNA: Both ^acon and HER2/neu amplification were 
done in duplicate for each sample. 

The thermal cycling conditions included 2 minvttes at 
5 OX and 10 minutes at 9b°C r followed by 40 cycles of 95 = C for 
15 seconds and 60°C for 1 minute. All reagents used for 
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RT-PCR were purchased from Applied Biosystems (Foster 
City, CA). 

Primeis ond Probes 

Primers and probes for>acrin and HER2/ncu mRNA 
were chosen using the computer program Primer Express 
(Applied Biosystems). Sequences of the forward primer for 
HER2/neu mRNA (CenBank accession number X03363) 
were 5'-TCC TCT CTG CAC CTG GAT GAC-3* and the 
reverse primer 5'-CCA AAG ACC ACC CCC AAG A3'; the 
sequence of the TaqMan probe was 5* (FAM)-AGC AGA ATG 
CCA ACC ACC GCA CA-(TAMRA)-3\ Sequences of the for- 
ward I primer for £actra mRNA (CenBank accession number. 
X00351) were 5TCC TTC CTG CCC ATG CAGES' and the 
reverse primer 5*-ACG AGG ACC AAT GAT CJT GAT CTT- 
3'; the sequence of the TaqMan probe was 5*(FAM>-CCT GTG 
GCA TCC ACG AAA CTA CCT TG(TAMRA)y. Probes were 
purchased from Applied Biosystems. 

Reol-Time RT-PCR 

To measure HER2/ neu expression in these tumors we 
used a readme quantitative RT-PCR based on TaqMan meth- 
odology, as previously described, 17 with minor modifications. 
Bnefty, this Technique allows, by means of fluorescence emis- 
sion, to find the cycling point when PCR product is delectable 
(Ct value or threshold cycle). As previously reported, the Ct 
value correlates to the starting quantity of the target mRNA. ia 
To normalize the amount of total RNA present in each reac- 
tion, we amplified the housekeeping gene £actin, which is 
assumed to be constant in both norma) samples and tumor 
tissues. 

Our results are expressed as relative levels of HER2/neu 
mRNA, referred to a sample, called a "calibrator,- chosen to 
represent IX expression of this gene. The calibrator was a 
breast cancer cellular line (MCK7) 39 that was analyzed on 
every assay plate with the unknown samples. AH of the ana- 
lyzed tumors expressed n-fold HER2/neu mRNA relative to 
the calibrator. 

The amount of target, normalized to an endogenous 
reference (0-actin) and relative to the calibrator, was defined- 
by the AACt method as described, by Livak K. (Sequence 
Detector User Bulletin 2; Applied Biosystems). Specifically 
the formula is applied as follows: 



-aao 



target amount 

where AA C. = {[Ct (HER2/neu sample) - Ct Vacrin 
sample)! ~ [Ct (HER2/neu calibrator) - Ct (^ctir, calibra- 
tor))!. 

Immunohistochemistry 

Formalin -fixed, paraffin-embedded tissue sections were 
deparaffimzed.Tehydmtcd, and exposed .o the primary anti- 
body using the E»Vision + system (Dako, Carpinieria CA) 
Primary ami_HER-2/neu antibody (rabbit polyclonal anii^ 

r 1? ,OE nUmber A0485; Dako > ra a PP^ m a dihinon 
of J.^OOOfor 60 mmutes at ioom tempersrure. Before expo- 
sure to the primary antibody, sections were mioowave-pre- 
.rea.cd ,„ EDTA, pH 8.0, to -retrieve antigenicity, and incu- 
bated wuh endogenous peroxidasc-blocking solurion for 10 
rmnutesat room temperature. Positive control, constituted bv 
? ,?? " TCinoma lowing mote than 80% positive starmn K 
for HtR2/nru, as well as negative control, in which .he 
primary antibody was omitted, weie stained in parallel. . 




• normal issue n=6 synovttl sarcomas symki sarcqne& 
(tow eipressioh) (hfch expression^ 
n=S. - r&4 

FIGURE 1. Dislribulfon of HEP2/neu mRNA levels in normal 
muscle tissues, ond in low- and high-expression sarcomas. 
Dcrto are expressed as mean and standard error of the mean 
for each group. ■ - 



All cases were examined for both cytoplasmic and mem- 
brane immunoreactivity. Cytoplasmic staining was evaluated 
on a semiquantitative scale, according to Klpatrick et al with 
minor modifications, 20 and reported as 0 (no staining or 
staining in <10& of cells), 1+ (weak staining in >10% of 
cells), 2+ (moderate staining in >10% of celb), or 3+ 
(strong staining in >J0% of celb). The presence of a mem- 
branous pattern of staining was recorded separately and 
scored as absent, (no staining or weal staining in < 10% of . 
cells) or present (complete and/or incomplete staining in 
>10% of cells). Tumors with a cytoplasmic score of 34 were 
considered to have high HER-2/neu protein expression. 

StotiSiico! Anolysis 

Starisiical differences were calculated by fisher's exact 
test. The Mest method was used to evaluate the differences 
between groups. Differences were considered statistically sig- 
nificant when Pwas <0.05. 



RESULTS 

HER2/neu mRNA Evoluolton 

All of the tissues analyzed contained delectable 
)eve)s of HEJ&neu mRNA Svt norma) tissue samples 
(skeletal muscle) were used to establish basaj level of 
HER2/neu mRNA. AH the normal samples expressed 
very low levels of HER2/neu mRNA, ranging from 0.9 
to 1.9 n (mean, 1.4 n). Among the 13 tumor samples, 
HER2/neu levels varied greatly, ranging from 2.1 to 
24 n. Setting a cutoff level at 7.9 n (a value that repre- 
sents the mean value of expression distribution of the 
SSs), 9 cases (69%) had low HER2/neu expression and 
4 cases (31%) had high HER2/neu expression (Fig 1; 
Table 1). The difference between the 2 groups (lo^v 
and high JTER2/neu tumors) was statistically significant 
(P = 0.0004). 
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FIGURE 2. Immunohisiochemtcol tocolfeotion ol HER?/neu in SS. (A) Posith/e control (breost cancer) showing typical strong 
membrane pattern or postlivity. (B) Cose 7, o biphasic S5. displaying very local membrane staining limited too gland. Inset toccd 
weak cytoplasmic posilivity in the epithelial component ot the same cose. (Q Cose 10, the epithelial component of this biphasic 
SS, displaying strong membrane pattern ot staining. (D) Case 4, an epithelioid area in a monophoslc fibrous SS. showing strong 
cytoplasmic positively. 



HER2/neu Pr'olein Expression 

Staining with HER?/neu antibody revealed a vari- 
able cytoplasmic and membrane staining pattern. 
Three. tumors (23%) showed strong staining involving 
both the cell membrane and the cytoplasm (cases 4, 5, 
and 10); weak to moderate, exclusively cytoplasmic 
staining was observed in 7 cases (cases 1, 3, 6, 7, 8, 9, 
and ] 1). No staining was detected in 2 tumors (cases 12 
and 13). Jn 1 case (case 2), a cluster ol glandular 
structures representing <5% of the rumor showed 
weak cytoplasmic and very focal membrane staining, 
the latter limited to a single gland. The epithelial/ 
epithelioid components exhibited stronger cytoplasmic 
staining compared with die spindle-cell component of 
the tumors. Membrane staining was predominantly in- 
complete and limited to the epitheiial/epithelioid ar- 
eas. All 3 cases with high immunohistochemical expres- 
sion of HER2/ncu (cases <\, 5, and 10) were grade Ul 
sarcomas, including ] MT r 1 JJF, and ] _\JF-PD SSs. 



Examples of HER2/neii icyjbprasmic'ahcl membrane 
staining are depicted in Figure 2. 

Correlation of Molecular and 
Immunohistochemico! Results 

A strong, statistically significant association was 
present between piotein expression, for both mem- 
brane and cytoplasmic staining, and HER2/neu rnRNA 
levels {P - 0.01 ), aJdrough 1 case (case 6) displayed 
discordant results. Interestingly, this neoplasm showed 
high } lER2/neu mRNA levels, whereas only weak stain- 
ing, limited to the cytoplasm of a minority of tumor 
cells, was detected by immunohistochernicaJ analysis. 

HER2/heu Expression ond Clinicopolhologic 
Porometers 

Both HER2/neu protein expression and mRNA 
levels were evaluated to establish the relationships to 
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TABLE 2. Cmetation Between Clinjcopathotogk: 
Featu/es and HER2/neu Expression as Detected by 
IHC and RT-PCR 



HER2/ncu 



IHC 



PGR 



Variable 



lvalue 



H PvaJuc 



3 3 3 3 

7 0 NS 6 1 NS 

8 0 7 1 

2 3 0.03 2 3 NS 

3 2 3 2 

7 I NS 6 2 NS 



Age (years) 
<40 
>40 
Sex 
Female 
Male 

Tumor size (cm) 

<5 
>5 

Histological grade 

n 3 0 3 0 

J" - 7 3 NS 6 4 NS 

Histological type 

BF . 2 1 2 3 

PD 3 J NS 2 2 NS 

Cheroo/Radiotherapy* 

Yes .42 .33 

™° 5 1 NS 5 1 NS 
Recurrence! 

Yes - 5 0 5 0 

No 2 3 NS 1.4 0.02 
' Metastasis! 

Yes 3 0' 30 

™ _ ""' 4 3 NS 3 4 NS 



■ Abbreviations: U low expression; H, high expression; NS, not 
significant; MF, monophasic fibrous; BF, biphasic; PD, poorly differ- 
entiated (including MF and BF *ith poorly differentiated areas). 

Information not available for case 7. 
t Cases 7, 12, and 13 were excluded from the analysis. 

clinicopathologic features, including local recurrence 
and metastatic disease. Two cases (cases ] 2 and 13) with 
follow-up less than 12 months and 1 case (case 7) for 
which clinical information was not available were ex- 
cluded from the analysis of recurrences and metastatic 
behavior. 

No correlation was observed between HER2/neu 
mRNA expression and age, sex, tumor size, tumor 
grade, histotype, and metastasis. A correlation between 
sex of the patients and HER2/neu protein expression 
was found. Jn fact, none or the female patients showed 
high HER2/nen protein expression (P = 0.03). Pa- 
tients with high Her2/neu mRNA levels had a lower 
risk of recurrence - than those with low Her2/neu 
mRNA levels {P = 0.02). None of the cases with high 
HEK2/ne\i mRNA levels developed metastatic foci, al- 
though the small number of observations precluded 
reaching statistical significance (P = 0.1). Results are 
detailed in Table 2. 

DISCUSSION 

The present work provides the first combined mo- 
lecular by real-ume RT-PCR and mimunohisiochernical 
evidence thai HER2/neu over expression occurs in SSs 



Our results indicate that this parameter may provide 
prognostic Information and suggest that a specific ther- 
apy with humanized monoclonal antibodies against 
HER2/neu may be considered in a significant number 
ofSSs. 

The HER2/neu oncogene has been ; extensively 
investigated as, a prognostic factor and more recently as 
a predictor of response to therapy. It has been demon- 
strated in breast cancer, where HER2/neu overexpres- . 
sion is usually associated with gene amplification, and 
in other epithelial tumors, mcluding ovarian, gastric, 
lung, and urinary bladder carcinomas. 

HER-2/ n eu amplification/overexpression appears 
to be an early event in oncogenic transformation by 
interacting with other members of the HER family/ In 
breast cancer, it is involved in cell cycle and apoptotic 
pathways through the antiapoptotic effects mediated by 
p53 and p21 deregulation. 

Whether HER2/neu overexpressioh plays an im- 
portant role in mesenchymal neoplasms remains con- 
troversial. An imrriunohistochemical study of sarcomas, 
using a monoclonal antibody, reported no evidence of 
immunoreactivity for HER-2/neu in 6 SSs as well as in 
other 197 mesenchymal tumors, with cytoplasmic reac- 
tivity observed only in 1 case of peripheral neuroepi- 
thelioma. A recent investigation reported gene ex- 
pression profiles of 41 soft tissue tumors with cDNA 
microarray analysis. Among these sarcomas, 6 
monophasic SSs were characterized by a unique expres- 
sion pattern of a cluster of 104 genes, including the 
epidermal growth factor receptor, which shows 50% 
homology with the HER2/ neu gene; 24 These data also 
suggest that the erb-B receptor family plays a significant 
role in SS. It has been demonstrated that a variable 
number of osteosarcomas overexpress HER2/neu. 11 ' 15 
However, more recent studies 20 '* 5 - 26 were unable to 
detect any HER2/ neu gene amplification and/or over- 
expression using fluorescence in situ hybridization 
(FISH), RT-PCR, and immun ©histochemistry. 

Differences in the techniques used may play an 
important role and explain (at least in part) these 
discrepancies. HER2/neu alterations can be evaluated 
using different techniques inxluding irruriuno histo- 
chemistry, FISH, Southern hybridization, Northern 
blot, and competitive, differential, or real-time RCR. 27 
Irmnun ©histochemistry is the most common method 
for detection of HER2/neu overexpression, but it is 
significandy affected by the sensitivity and specificity of 
the antibodies used, the type of tissue (frozen versus 
formalin-fixed), and the various interpretative criteria 
and scoring systems used lo evaluate cases. Indeed, 
most studies of H£R2/neu expression in osteosarcoma 
used immunobistochemical techniques, with different 
monoclonal or polyclonal antibodies. The discrepancy 
in results may stem from the use of different antibodies, 
as well as a lack of standardized evaluation. 

For these reasons, to evaluate H£R2/neu immu- 
noreactivity in our study,. we used a polyclonal antibody 
(Dako, Carpinteria, CA), arguably the most diffuse and 
thoroughly tested antibody for HER2/neu assessment. 
Furihermore, we investigated HER2/nen mRNA ex- 
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pression with real-time RT : PCR, because it has been 
demonstrated that mRNA levels correlates tightly with 
protein expression. 28 At present, real-rime RT-PCR 
probably represents the most powerful tool for quanti- 
tative analysis, because it allows better internal control 
and reduction of sample contamination, and provides 
more objective results. 18 

We analyzed HER2/neu gene expression at the 
mRNA and protein level in 13 cases of SS. HER2/neu 
expression was found in all of the cases investigated, 
and mRNA content in the tumors varied from 2.1 to 
24 n. 

The variability of mRNA levels in SSs is reflected 
on heterogeneity of protein expression pattern as de- 
tected by irrmiunohistochemistry. We found that 
HER2/neu immunoreactivity correlates strongly with 
mRNA levels. A convincing cytoplasmic immunoreac- 
tivity was documented with the polyclonal antibody in 
10 of 13 sarcomas. Distinct membranous staining was 
observed in 3 cases, although it was never comparable 
to the positive breast control. It was predominantly 
incomplete and identified in the epithelial/ epithelioid . 
component of SS. Interestingly, all of the cases with 
strong cytoplasmic staining also exhibited a merribrane- 
staining pattern. 

In breast cancers, a membranous pattern, of stain- 
ing is thought to be specific for HER2/neu protein 
expression and correlated with gene amplification, 
whereas cytoplasmic staining is usually considered non- 
specific. 29 However, cytoplasmic positivity for HER2/ 
neu has been reported to be prognostically significant 
in other tumor types^ including bladder, colon, pan- 
creas, thyroid, and nasopharyngeal carcinomas, and 
even in breast cancer. 3 * 3 * 

Patients with high HER2/neu mRNA expression 
had a significantly lower risk of recurrence. Similarly, 
all of the cases with high HER2/neu expression did not 
metastasize, although this correlation did not reach 
statistical significance due to the small number of cases 
studied. These data suggest that HER2/neu plays a role 
in the biology of SS and that HER2/neu overexpression 
may be linked to a less-aggressive clinical behavior. 
Indeed, unlike many cancers where HER2/neu overex- 
pression has been shown to correlate with poor prog- 
nosis, HER2/neu levels are linked to a more favorable 
clinical course in other malignant tumors, such as thy- 
roid carcinoma and osteosarcoma. 1534 

The moleculaj mechanisms responsible for the 
action of HER-2/neu in SSs are unknown. A possible 
interaction between HER-2/neu and the other mem- 
bers of HER family could be important in tumorigen- 
esis. Derangements of other oncogenes, tumor suppres- 
sors, and apoptosis regulators have been described in 
SSs. For instance, many SSs have been shown to be 
diffusely positive for bcl-2 family proteins (bcl-2, bax, 
bcl-x, and bak). "These members of the bcl-2 family are 
involved in the regulation of apoptosis in SS. 36 This 
raises the hypothesis thai complex alterations in apop- 
tosis-controlling mechanisms are present in these neo- 
plasms, with HER-2/neu interacting with Bcl-2 farnily 



members. Further studies . are needed to clarify the 
mechanisms of apoptosis in SS. 

Depending on the size and location, the therapy of 
choice for SS is radical local excision or amputation.. 
Whenever radical surgery cannot be performed, radio- 
therapy in concert with local excision is suggested in an 
attempt to avoid the need for amputation. Only re- 
cently has a study evaluated the possible role of chemo- 
therapy in the treatment of SS. 

The role of molecular markers in predicting treat- 
ment responsiveness is currently the focus of extensive 
investigation. Breast cancer patients with high HER2/ 
neu expression appeared to benefit from high-dose 
CAF (cyclophophamide, adriamydn, and 5-fHuoroura- 
cil) therapy. 38 In our study, 3 of 4 patients ivith high 
HER2/neu expression received adjuvant chemother- 
apy with ifosfamide; these patients had a favoiable clin- 
ical outcome. These data raise the possibility that 
HER2/heu may have value in predicting.which patients 
are likely to respond to specific adjuvant chemotherapy 
regimens. Whether the favorable significance of 
HER2/neu expression depends on predicting clinical 
recurrence, response to chemotherapy, or both re- 
mains to be fully elucidated in SS patients. • 

To the best of our knowledge, this is the first 
report that shows expression of HER2/neu in primary 
SS by real-time RT-PCR. Elevated levels of HER2/neu 
rnRNA and protein are found in a significant group of 
SS patients, and these levels appear to correlate with 
features of good prognosis. Furthermore, our results 
suggest that this mechanism of disease in SS may be the 
target of specific inhibitory therapies based on human- 
ized monoclonal antibodies. Considering the small 
number of patients examined, further investigation is 
needed to confirm these preliminary findings. 
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Gain and loss of chromosomal material is characteristic of bladder cancer as well as 
maUgnant transformat.on in general. The consequences of these changes at both the 
ranscnption and translation levels is at present unknown partly because of technical 
hm.tat.ons. Here we have attempted to address this question in pairs of non-invasive and 
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oS^ t,V ^ 8en ? m ^ bridiZati0n ' Wgh denS,ty o'igPnuclobtide array-based monitoring 
Of transcnpt levels (5600 genes), and. high resolution two-dimensional gel 
e ectrophores.s. The results showed that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This effect depended (p < 0 01 5) on the 
magnitude of the comparative genomic hybridization change. In general (18 of 23 cases> 

°nM S r al ^ With m ° re 013,1 2 - f0ld S ain of DNA sh< >^ * corresponding increase 
n mRNA ^anscnpts. Areas with loss of DNA, on the other hand, showed eithef r"eS 
or unaltered transcnpt levels. Because most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and protein alterations in relatively 
few cases of well focused abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and protein levels. The implications 
as well as I.m.tations, of the approach are discussed. mpncations, 

PMfD: 12096139 [PubMed - indexed for MEDLINE] 



Research 



Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
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Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef-. 
feet depended (p < 0.015} on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be-, 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proleins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 

Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
■ have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1, 
emsf, and N-myc (3t5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping , fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11p-, 1q+, 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome- wide tech- 
nology for detecting genomic gains and losses (CGH) with, 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non- invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary). 



1 The abbreviations used are: CGH, comparative genomic hybrid : 
ization; TCC, transitional cell carcinoma; LOH, loss ol heterozygosity; 
PA-FABP, psoriasis-associated latty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. B, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
(/err). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical fine proken) indicates a ratio value of 1 {no change), and the vertical lines next to ft (doffed) indicate a ratio of 
0.5 (/eft) and 2.0 (r/gnf). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDL.DK/sdata.htmi). The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase {black). >2-!old decrease (6/ue). no significant change [orange). The bar to the far right, entitled Expression shows the resulting change . 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated p/ack), at least half of the genes 
down-regulated [blue), or more than half of the genes are unchanged {orange). If a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2-lold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade t and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade HI. 

mRNA Preparation—Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium- guanidinium thiocyanate 
solution and stored at -80°C. Total RNA was isolated using the 
. RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poty(A)* RNA was isolated by an oligo(dT) selection step (Oligotex 
. mRNA kit; Oiagen). 

CRNA Preparation— t ^g of mRNA was used as starting material. 
The .first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
tured s instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambion). Biot in- labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Oiagen). 

Array Hybridization and Scanning— Array hybridization and scan* 
ning was modified from a previous method (13). 10 ^9 erf cRNA was 
fragmented at 94 *C for 35 min in buffer containing 40 mM Tris 
acetate. pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequentry cooled to 40 °C ( and loaded onto the Asymetrix' probe 
array cartridge. The probe array was then incubated for 16 h at 40 e C. 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 °C followed by 4 washes in 0.5 X SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 10 iig/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1— continued 



for 30 min at 25 *C IpHowed by 1 0 washes in 6.x SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope {made (or Affymetrix by Hewlett-Packard). The readings 
Irorn the quantitative scanning were analyzed by Asymetrix gene 
expression analysis software. 

Microsaleltite Analysis— Micros a tellite Analysis was performed as 
described previously (14). Microsate titles were selected by use of 
www.ncbi.nlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 tor 35 
cycles. The ampficons were denatured and electrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele delected in tumor ampficons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure tor 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two- dimensional gel Western 
tmmunoblotting, and comparison with the master I wo- dimensional gel 
image ol human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH- Hybridization of differentially labeled tumor and normal DNA 
. to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein- labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 jig) were 
denatured at 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counter stained with 0.15 /ig/ml 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein- labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
green: red fluorescence intensity ratio profiles were calculated using 
tour images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 
Top, CGH used as independent variable 0* CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable {if expression aJteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulation 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down- regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



80% 



17% 



Expression change clusters 


Tumor 733 vs. 335 


Concordance 


Expression change clusters 


Tumor 827 vs. 532 


Concordance 


CGH alterations 


CGH alterations 


16 Up-regulalion 


1 1 Gain 


69% 


17 Up-regulation 


10 Gain 


59% 


2 Loss 






5 Loss . 






3 No change 






2 No change 




21 Down-regulation 


1 Gain 


38% 


9 Down-regulation 


OGain 


33% 


8 Loss 






3 Loss 






12 No change 






6 No change 




15 No change 


3 Gain 


60% 


21 No change 


1 Gain 


81% 


3 Loss 






3 Loss 






9 No change 






17 No change 





two invasive tumors (stage pT1, TCCs 733 and 827), whereas 
. the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+; 9q-, 
and respectively. Both invasive tumors showed changes 
(1q22-24 + , 2q14.1-qter-, 3q12-q13.3-, 6q12-q22~, 
9q34+, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
, some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-foId for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. ^A) and 
20q12 in TCC 827 (Fig. 1B). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1 ,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way il was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non- invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal abenations are not known at high resolution. 

. Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
. chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an Increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom)- Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area dearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2). For both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the QNA copy number) and alterations detected by the array 
based technology (Fig. 2). Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-oft point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
. much larger extent. 

Microsatellite- based Detection of Minor Areas of Loss : 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21 t and 7q21*and q32, 9q34. and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- . 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression ot mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21,1, and 16q24 
and in TCC 827 at chromosome 11pl5.5, 12p11, 15q11.2, 
and I8q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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FtG. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (o) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close . 
to general /3-speclrin (gene number 1 1 on Ftg. 1) and of (d) tumor 827 
showing loss of heterozygosity at chromosome 18q12byS18S1118 
close to mitochondrial 3-oxoacyl -coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
. from normal DNA from leukocytes (A/), and the lower curves show the 
electropherogram from tumor DNA (7). In alt cases one allele is 
partially lost in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels — 
2D-PAGE analysis, in combination with Ccomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. . Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive turriors (along horizontal axis) or 
as absent in non- invasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a contounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 {*Q were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
left), phosphoglucomutase 1 , glutathione transferase class p. number 
4, fatty acid-binding protein homologue, cytokeratin 15, and cyto- 
keratin 13; B (from /eft), fatty acid- binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; C (from left), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-t, . and 
pre-mRNA splicing factor; D, mesotheltal keratin K7 (type II); E (Irom 
fop), glutathione S-transf erase-™ and mesothelial keratin K7.(type II); 
F (from fop arid /eft), adenytyl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phospboglycerate mutase, annexin IV, cy- 
toskeletal 7-actiri, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain- type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationalry controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 0-1 subunit; G, (from top and /eft). TCP20, calgizzarin, 70- 
kDa heat shoe* protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 1 9, triosephosphate isomerase, hnRNP F r liver gryc- 
eraldehyde-3- phosphatase dehydrogenase, glutathione S-transfer- 
ase~ff, and keratin 8; H (Irom /eft), plasma gelsolin, autoantigen cal- 
reticulin, thioredoxin, and NAD+Tdependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase ^-subunit, cyto- 
keralin 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; JL (from fop and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunit, o-enolase, GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 



42 Molecular & Cellular Proteomics 1, 1 



Gene Copy Numbers, Transcripts, and Protein Levels 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 1 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4), For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733;. see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). . 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene, expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Table u 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration" 


Protein alteration 


Annexih II 


1q21 


733 


Gain 


Abs !o Pres* 


, Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cylokeratin 17 


17q12-q21 


827 


Gain 


3.8- Fold up 


Increase 


Cytokeralin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10- Fold down 


Decrease 


FBP1 


. 9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibit in 


17q21 


827/733 


Gain 


3.7-/2-5-Fold up b 


Increase 


Prolyl- 4 -hydroxyl 


I7q25 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


hnRNPBI _ 


7p15 


827 


Loss 


2.5-Fbld down 


Decrease 



* Abs, absent; Pres, present. 

b In cases where the corresponding alterations were tound in both TCCs 827 and 733 these are shown as 827/733. 




FrG. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
(/eft) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearty, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 (red annofafionj. The' tile on the array containing probes for 
cylokeratin .15 is enlarged betoiv the array {fed arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC. 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cylokeratin 13, a high 
transcript levef was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom o\ the figure (/eft) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated "m the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detecled in TCC 733 (166 units). IEF t isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these, transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (eg. in TCC 733 transcript from cellular ligand of 
annexin » gene (chromosome 1q21) from absent .to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription; it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may He in the loss of controlled 
methylatron in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be.demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
. off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3tumors. InpTa tumors, these include 9p-;9q-, 1q+. Y- 
(2, 6), and in pTI tumors. 2q-,11p-, 11q-> 1q+, 5p+. 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33~ and 
9q- and respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 11q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases, it seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal celts and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkabie 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translation^ 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D- PAGE (25). and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 

Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of .chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number. mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger-number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived.from genes with 
known locations, and 2D gel analysis . to obtain information at 
the posHranslational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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Expression of bcr-abl mRNA in individual chronic myelogenous leukaemia 
cells as determined by in situ amplification. 

Pachmann IC Zhao S , SchcnkT . Kantarjian H . El-Naggar AtC Siciliano MX Guo 
JO, Arlinghaus RB . Andreeff M . 

The University of Texas M.D. Anderson Cancer Center, Department of Molecular 
Haematology and Therapy, 1515 Holcombe Boulevard, Houston, TX 77030, USA. 

We present the results of a novel method developed for evaluation of in situ 
amplification, a molecular genetic method at the cellular level. Reverse transcription 
polymerase chain reaction (RT-PCR) was used to study bcr-abl transcript levels in 
individual cells from patients with chronic myelogenous leukaemia (CML). After 
hybridizing a fluorochrome-labelled probe to the cell-bound RT-PCR product, bcr-abl 
raRNA-positive cells were determined using image analysis. A dilution series of bcr-abl- 
positive BV173 into normal cells showed a good correlation between expected and actual 
values, hi 25 CML samples, the percentage of in situ PCR-positive cells showed an 
excellent correlation with cytogenetic results (r = 0.94, P < 0:0001), interphase 
fluorescence in situ hybridization (FISH) (r = 0.95, P = 0.001) and hypermetaphase FfSH 
(r- 0.81, P < 0.001). The fluorescence intensity was higher in residual CML cells after 
interferon ([FN) treatment than in newly diagnosed patients (P = 0.004), and was highest 
in late-stage CML resistant to IFN therapy and lowest in CML blast crisis (P = 0.00 1). 
Mean fluorescence values correlated with bcr-abl protein levels, as determined by 
Western blot analysis (r = 0.62). Laser scanning cytometry allowing automated analysis 
of large numbers of cells confirmed the results. Thus, fluorescence in situ PCR provides a 
novel and quantitative approach for monitoring tumour load and bcr-abl transcript levels 
in CML. 
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Cell localization and regulation of expression of cytochrome P450 1A1 and 

tvlZ£< r g - " mdUCti ° n Wlth 3 - met MchoIanthrene using mRNA 
hybridization and immunohistochemistry. 

INSERM Unite 139, Hqpital Henri Mondor, Creteil, France. 

Im^V h ^POnse of various pulmonary cell types to polycyclic 

aroma uc hydrocarbons the expression of cytochrome P450 (GYP) 1 Al and 2B IrnRNA 
«n the lung of rats, w.th or without induction by 3-memylcholanthrene (3MC)wa7^ 
Sn™ y fJT h ^ ndiz f on u ^g appropriate 35S-labeled riboproS Th^ 
expression of the corresponding proteins was investigated irnmunohistochemi4y 

mSST- ^ pneumocytes and venous endothelial cells. In Clara cells 

mRNA expression was detected as early as 1 h after induction, peaked between 2and 4 h 
and was completely undetectable at 14 h. In contrast, venousSeliaS^e H 
pneumocytes exhibited permanent mRNA expression of CYP 1 Al ini^Xi^ 
ra^ These Wic results explain the striking absence of correlation bZS Ld 

SSS" CCl,S 24 h 3fter of the inductirp^I^ 

ttiese cells exhibited intense protein expression with no mRNA. In contrast, a good 

ZfhnTlZ \ \ Md ^ eFI P neumoc ^ b« no expression in endothelial 
cells. This study clearly distinguished the regulation of CYP 1 Al exoression intZL 
lungfrom that d^cribed in the liver. The difWes obse^ 

mSTnerf^ inV0,Ved ' -phasizeSS 

must be performed at the cellular level in order to understand the specific response to 
xenob,ot,cs, not only of this organ as a whole but also of its variouLalomTc s^res. 
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Correlative iuunuuouistoclieiuical and reverse transcriptase polymerase 
chain reaction analysis of somatostatin receptor type 2 in neuroendocrine 
tumors of the lung. 

Papotli M , Croce S , Macri L , Funaro A , Pccchioni C , Schindler M , Dnssotati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Italy. 

Somatostatin receptors type 2 (sst2) have been frequently detected in neuroendocrine 
rumors and bind somatostatin analogues, such as octreotide, with high affinity. Receptor 
autoradiography, specific mRNA detection and, more recently, anttsst2 polyclonal 
antibodies are currently employed to reveal sst2. The aim of the present study was to 
investigate by three different techniques the presence of sst2 in a series of 26 
neuroendocrine tumors of the lung in which fresh frozen tissue and paraffin sections were 
available. It was possible, therefore, to compare, in individual cases, RNA analysis 
studied by reverse transcriptase polymerase chain reaction (RT-PCR), in situ 
hybridization (ISH), and immunohistochemistry. A series of 20 nonneuro endocrine lung 
carcinoma samples served as controls. RT-PCR was positive for sst2 in 22 of 26 samples, 
including 15 of 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 small-cell 
carcinomas. The sst2 mRNA signal obtained by RT-PCR was strong in the majority 
(87%) of typical carcinoids and of variable intensity in atypical carcinoids and small-cell 
carcinomas. A weakly positive signal was observed in 5 of 20 control samples. In 
immunohistochemistry, two different antibodies (ariti-sst2) were employed, including a 
monoclonal antibody, generated in the Department of Pathology, University of Turin. In 
the majority of samples a good correlation between sst2 mRNA (as detected by RT-PCR) 
and sst2 protein expression (as detected by immunohistochemistry) was observed. 
However, one atypical carcinoid and one small -eel I carcinoma had focal immunostaining 
but no RT-PCR signal. ISH performed in selected samples paralleled the results obtained 
with the other techniques. A low sst2 expression was associated with high grade 
neuroendocrine tumors and with aggressive behavior. It is concluded that I ) 
neuroendocrine tumors of the lung express sst2, and there is a correlation between the 
mRNA amount and the degree of differentiation; 2) immunohistochemistry and ISH are 
reliable tools to demonstrate sst2 in these tumors; and 3) sst2 identification in tissue 
sections may provide information on the diagnostic or therapeutic usefulness of 
somatostatin analogues in individual patients with neuroendocrine tumors. 
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Correlative Immunohistochemical and Reverse 
Transcriptase Polymerase Chain Reaction Analysis of 
Somatostatin Receptor Type 2 in Neuroendocrine 
Tumors of the Lung 



" Aola Funafo, ph jx, Carla Peccto'oiu\ Marcus Schindler, M.D., and 
Gianni BDSsolati, M-D., F.R.c.Path. 



Somatostatin receptors type 2 (sst2) have been frequently de- 
tected in neuroendocrine tumors aod bind somatostatin ana- 
logues, such as octreotide, with high affinity. Receptor autora- 
diography, specific mRNA detection and, more recently, anri- 
ssi2 polyclonal antibodies are currently, employed to reveal 
: ssl2r The aim of the present study was to investigate by three 
■ different techniques the presence of sst2 in a series of 26 neu- 
roendocrine rumors of the lung in which fiesh frozen tissue and 
paraffin sections were available. It was possible, therefore, to 
compare, in individual cases, RNA analysis studied by reverse 
transcriptase polymerase chain reaction (RT-PCR), in siru hy- 
bridization (JSH), and rmrounohistochemistjy. A scries of 20 
nonneurbendocrine lung carcinoma samples served as controls, 
RT-PCR was positive for sst2 in 22 of 26 samples, including 15 
of. 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 
small-cell carcinomas. The sst2.mRNA signal obtained by RT- 
PCR was strong in the majority (87%) of typical carcinoids and 
of variable intensity in atypical carcinoids and small-cell car- 
cinomas. A weakly positive signal was observed in 5 of 20 
control samples. Jn iroroun ©histochemistry; two different anti- 
bodies (anti-sst2) were employed, including a monoclonal an- 
tibody, generated in the Department of Pathology, Univeisity 
of Turia In the majority of samples a good correlation beiween 
sst2 mRNA (as detecied by RT-PCR) and sst2 protein expres- 
sion (as detected by immunohisiochemisoy) was observed. 
However, one atypical carcinoid and one small-cell carcinoma 
had focal immunostaining but no RT-PCR signal: ISH per- 
formed in selected samples paralleled the results obtained, with 
the other techniques. A low sst2 expression was associated with 
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high grade neuroendocrine tumors and with aggressive behav- 
ior. Jt is concluded that 1) neuroendocrine rumors of the lung 
express sst2, and there is a correlation between the mRNA 
amount and the degree of differentiation; 2) irnmunohrstochem- 
istry and JSH are reliable tools to demonstrate sst2 in these 
rumors; and 3) sst2 identification in tissue sections may provide 
information on the diagnostic or therapeutic usefulness of so- 
matostatin analogues in individual patients with neuroendo- 
crine rumors. ■ . 
Key Words: Neuroendocrine— Lung— Tumors— Somato- 
statin receptors — ImrounohistochemistTy^-Small cell carci- 
noma—Reverse transcriptase polymerase chain reaction. 

Diagn Mol Pathol 9(1): 47-57, 2000. 



The somatostatin receptor family (sst) includes at least 
jive isofbrms that have been recently identified and char- 
acterized (18,32,41). The ssts are widely distributed in 
normal human tissues and m human turners. Sst type 2 is 
more commonly detected in neuroendocrine rumors 
(32,37) and binds the somatostatin analogue octreotide 
with high affinity. 

Sst localization had originally been- demonstrated by 
means of binding assays of radiolabeled somatostatin 
analogues (20,25,31). Subsequently, specific sst messen- 
ger KNA (mRNA) detection was obtained by means of in 
situ hybridization (ISH) and reverse transcriptase poly- 
merase chain reaction .(RT-PCR) (14,32,37). Recently; 
polyclonal antibodies specific for different isoforms of 
sst were produced and used in kruTmnohisicxrhemisn-y 
(10,12,15,1830,35,36). Given the well-known heteroge- 
neity of neoplastic populations, in situ methods (iramo- 
nohisiochemistry and ISH) allow a more definite map- 
ping of die distribution of the receptor in such tissues. 
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This is potentiaDy useful for predicting the responsive- 
ness of a given neoplastic cell population to medical 
treatment with somatostatin analogues, which are used in 
the clinical setting for both diagnostic and therapeutic pur- 
poses with special reference to neuroendocrine tumors- 

The spectrum of neuroendocrine rumors of the lung 
includes well-differentiated neoplasms (so-called typical 
carcinoids) and poorly differentiated small- cell carcino- 
mas (SCCs). Intermediate forms sharing features of both 
the aforementioned types also belong to this spectrum 
(so-called atypical carcinoids or well-differentiated neu- 
roendocrine carcinomas). Finally, large-cell neuroendo- 
— Grine-caTeinoma-has-been4dentified and included-is-tbis- 
tumor group (4,40). The tissue distribution of sst2 in 
neuroendocrine tumors of the lung has not been thor- 
oughly characterized, although individual samples of 

: bronchial carcinoids were found to express sst (30)1 
SCCs (but not non-small-cell types) were also shown to 
be sst2 positive by receptor binding assay (33). More- 
over, sst2 has been detected in in vitro cell cultures of 
human SSC of the lung (39,42)- No study on a series of 
neuroendocrine tumors of the Jung including all neuro- 
endocrine lung tumor types has been reported to date* 

The airh of this study was therefore to investigate the 
presence of ssl2 mRNA and protein in a series , of 26 
neuroendocrine tumors of the lung, employing different 
technical approaches, such as PT-PCR, JSH, and iminu- 
norjustochencustry. To this purpose a monoclonal anti- 

. body to sst2 (N- terminal) was generated in the Depart- 
ment of Pathology, University of Turin. The results were 
then compared and related to the rumor grade and to 
other clinic opathologic parameters. 

MATERIALS AND METHODS 

Case Series and RNA Extraction 

Twenty-six samples of neuroendocrine rumors of the 
lung, in which fresh frozen tissue was available, were 
retrieved from the surgical pathology file of the Univer- 
sity of Turin, Italy. All samples were reviewed applying 
currently accepted criteria of classification (4,40), and 
the neuroendocrine nature was confirmed by positive im- 
niunostairiing for chromogranin A (CgA) (with or with- 
out antigen retrieval) or synapiophysin, and by positive 
RT-PCR for CgA nxRNA. According to the classi- 
fications described here, these included 15 well- 
differentiated neuroendocrine tumors (typical carci- 
noids), 6 well- differentiated neuroendocrine carcinomas 
(atypical carcinoids), and 5 SCCs. 

A series of 20 non-small-cell lung carcinomas (10 
squamous, 9 adenocarcinomas, and 1 large- cell anaplas- 
tic) lacking neuroendocrine differentiation, as demonstrated 
by negative iirununoJiisTOcheniistrv* and RT-PCR for CsA 
(1), served as a control group. Climcopatbologic data and 
follow-up infonrration were obtained for all patients. 



For hybridization analysis, total RNA was extracted 
using the guanidine ubiocyanate-cesiurn chloride method 
(5). The concentration of RNA was estimated by spec- 
trophotometry, and RNA degradation was assessed by 
agarose gel electrophoresis, as previously reported (37). 

Reverse Transcriptase Polymerase Chain Reaction 
f or ss M and Chrornograniri A 

Total RNA (2 ixg) was first digested, with 10 units of 
RNase-free DNase (Boehringer, Mannheirn, Germany) 
in a 10-uX solution containing 20 rninol/L MgCJ 2f to 
avoid DNA contarjunation. The solution was kept at 
room temperature forTO toinutesrtben heated for 5 min- 
utes at 70 D C to inactivate the DNase molecules; 40 
pmol/L of bligodeoxythymidine primers (oligo-dT16) 
were added and the solution was heated again at 70°C for 
10 minutes, then chilled on ice to allow the primer hy- 
bridization. The resulting solution was reverse tran- 
scribed using 100 units of reverse transcriptase (Gibco 
BRL, Gaitersburg, MD). Complementary DNA (cDNA) 
was generated in a 50-jjlL final reaction volume contain- 
ing 50 mmol/L Tris-HG pH 83, 75 inrnol/L KO. 3 
rninol/L MgCl 2 > 10 mrxjol/L ditto othreitol, 1 rrtmol/L de- 
oxynucleotide triphosphates (dNTPs), and 20 units of 
RNasin (Proroega, Madison^ WJ). The solution was 
heated at 37°C for 90 minutes. Finally, the enzymes were 
inactivated by heating to 70 t, C for 10 minutes. 

The efficiency of the reverse transcription was deter- 
mined by performing a PGR reaction having the P 2 - 
microgJobulin "housekeeping gene" as a target. PCR was 
carried out in a 10-ftL final reaction volume containing 
1 pL of cDNA template, 10 pmol of sense and antisense 
oligonucleotide primers, 67 mmol/L Tris-HQ pH 8.8, 16 
mrnol/L (NH 4 )2S0 4 , 0.01% polysorbate 20, 2 mmol/L 
dNTPs, 1 mmol/L MgCl 2 , and 0.5 units of Taq polymer- 
ase. P 2 -Miaoglobulin, sst2, and CgA PCR reactions were 
performed using the same protocol ai the following PCR 
conditions: 35 cycles, each cycle consisting of denaturaiion 
at 94°C for 2 minutes, armealing at 55°C for 1 minute for 
^-microglobulin, at 61 D C for sst2, and at 68°C for CgA; 
extension was performed at 72°C for 1 minute. The primers 
used for RT-PCR (9,1 1 ,2337) are reported in Table 1 . 

The amplified fragments were run in a 1% agarose gel, 
containing .ethidium bromide. Strict precautions against 
contamination were undertaken (19) and negative con- 
trols (a no- template control and a no^reverse transcriptase 
control and distilled water to replace the RNA) were in- 
cluded. The RNA extracted horn an H716 neujoendocrine 
colon carcinoma cell line, and from a neuroblastoma (37) 
served as positive controls for CgA and sst2 T respectiveJy. 

Anribodies 

Two different .antibodies specific for sst 2 were em- 
ployed: The first one was a monoclonal antibody raised 
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Size of PCR 
product (bp) 



Position 



Study 



120 
2B4 
5B3 



286-305 
3B9-408 

665-6B4 
31&-335 
B78-e97 



Gussow et aL {9) 
Sesrjnretai.(37) 
HelrnanetaL (11) 



- PCR; polymerase chain reaction. 

in the Department of Pathology (University of Turin) 
snecificforjr^^ 



"^5Ta and B receptor isoforms). The octapeptide 
EpyYDLTS, corresponding to amino acids 35 to 42 of 
the human receptor (and differing by one amino acid 
jrom the mouse sequence), was synthesized, having a 
ly<dn added to tbe N-terminal. This sequence was similar 
toibal used by other groups to produce polyclonal anti- 
bodies (17,18,27). This .sequence was rather short but 
made it possible to avoid extensive homology with sstl. 
In addition, according to a genbank search using FASTA 
(28) this protein sequence is unique to human sst2 and 

- has a partial homology only with rat and human nuclear 
receptor retinoid orphan nuclear- receptor-beta (a protein 

" bavin* nuclear localization)- Three Balb/c mice were in> 
njuniz^d with the peptide conjugaled to keyhole limpets 
hemocyarrin (KLH) (Sigma, St. Louis, MO) following 

' the standard procedure. After the first intrasplemc injec- 
tion (100 ftg of protein) at time 0, the mice* were intra- 
peritoneally injected six times with the peptide-KLH 
conjugate (150 fig) in the presence of Freund adjuvant 
Tbe reactivity of the sera from each animal was evalu- 
ated using an enzyme-linked immunosorbent assay, us- 
ing tbe peptide coated onto the plastic. Tbe bybridomas 
were produced by somatic fusion of immunized spleno- 
cytes with the mouse myeloma cell line Ag8.X 63.653, 

.foDowing the standard technique (21). Tbe monoclonal 
antibodies of interest were selected on the basis of tbe 
reactivity with tbe target peptide and with appropriate . 
tissue sections. Tbe latter included formalin-fixed and 
paraffin-embedded sections of pituitary, gland and pan- 
aeatic islets and were analyzed by means of irnmuno- 
peroxidase staining. Parallel control experiments were 
also performed by staining serial sections of these tis- 
sues, emitting (be primary antibody ox with the preim- 
rrjune serum or with the antibody pjeadsorbed with high 
concentrations (1 rog/mL) of the antigenic peptide. In 
addition, the selected .monoclonal antibodies (coded 
10C6 and lOG4),.botb of JgM isotype, weie further char- 
acterized by Western blotting. Membranes were prepared 
from stable transfected Chinese hamster ovary (CHO> 
K) cells, individually expressing recombinant human so- 
matostatin receptors (sstl to sst5). Western blotting was 
performed as previously described (36). Tne monoclonal 



antibody was used as culture supernatant at 1:3 dilution 
_for 9'hnurs at room temperatirje.m Tris-buffered saline 
(TBS) supplemented with 0.1% poJysorbate 2CL Blots 
were washed in TTBS and incubated with peroxidase- 
conjugated goat antimouse IgM, diluted 1:1.500 for 90 
minutes at room temperature. Then, blots were washed ra 
TTBS and immunocomplexes were visualized using 
ECi followmg.manufacwrer's instructions (Amersbam, 

Bucks, UK). " ' v . . ■ 

A second polyclonal antibody was produced that bad 
been characterized previously (35,36). This antibody 
(coded K230) was raised in sbeep and was specific for a 
sequence of the C-terminal portion of tbe sst2A (KSRL- 
>TETICTQRTL1>^EDLQ, amino acids 347 to 366). 

Irnj^imobistochernistry 

Sections 4 or 5 tx thick, adjacent to those used for 
conventional histopathologic examination and imrxmno- 
staining for neuroendocrine markers, were coDected onto 
poly-L-lysine-coaied slides. The proliferative activity of 
the tumors was assessed by means of K567 imrnunosta- 
nining (clone MIBl, Immunotecb, Marseille, France), 
diluted 1:10 after microwave-based antigen retrieval in 
citrate buffer). Tbe ascitic fluid of monoclonal antibody 
10G4 was used in this study and was applied to tissue 
sections with prior antigen retrieval (three 3-mirmte pas- 
sages in a.microwave oven at 800 W in citrate buffer P H 
6.0), at the dilution of 1:10,000 or 1:12,000 for 30 rrm> 
utes' at room temperature, tbe antiserum coded K230- 
was applied overnight at a dilution of 1 :300 with no pnor 
antigen retrieval. The irnrnune reactions were then re- 
vealed with the immunoperoxidase technique (13) using 
the streptavidin-peroxidase kit and diaminobenzidme as 
chroroogen. A weak nuclear counteistain or no counter- 
stain was used in parallel sections. Control stainings for 
both antibodies included immunoperoxidase of serial 
, sections using preimraune serum or antibody pread- 
sorbed with the antigen or buffer instead of tbe primary 
antibody. 

In Sim Hybridization 

Selected tumors (12 samples) were also analyzed M 
ss\2 mRNA expression by means of a nonradioactive, 
ryi amide deposition- based 1SH technique. The procc 
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dure of amplification was modified from procedures re- 
ported by Kerstens et al (16), Speel er al. (38), and' the 
GenPoint (biotinyl-tynamide) manufacturer (Dako, Glos- 
trup, Denmark). Briefly, 5-jJun-tbick paraffin sections 
were collected onto silane-coated slides and deparaf- 
fmized through xylene and graded alcohols to phosphate 
buffer saline (?BS) : The slides were then incubated fox 5 
minutes in a microwave oven ai 800 W in citrate buffer 
pH 6.0. After washing in PBS, they weie digested with 
proteinase K(l n-g/mL) for 10 minutes at 23 °C Endog- 
enous peroxidase activiry was blocked with 3% hydro- 
gen peroxide and endogenous biorin was blocked using 
~^dnr- bloc king iea gent-for-35 minutes -foHo wed-by— 



washing in PBS and biotin-blocking reagent for 15 rnin- 
utes (3). Sections were then preby wndi2ed for 1 hour at 
room temperature in a mixture composed of 4x SSG, 
- 50% forroamide, Denbardt's 1 x, dextrane sulfate 5x, 500 
Hg/raL salmon sperm DNA, and 250 ^g/mL tRNA. Hy- 
bridization took place overnight at 42°C in a solution 
containing the specific probe at a concentration of 1 
pmol/mL The probe was a digoxigenin-labeled 48-base 
oligonucleotide (32), complementary to positions 91 to 
J 39 of the human sst2 gene (41). After hybridization, 
excess hybridization buffer and coverslips were removed 
by a rapid wash in 4x SSC followed by stringent washing 
in 0.1 x SSC for 10 minutes at 42 D C. The hybrids were 
jevealed by the following incubation steps: peroxidase- 
labeJed antidigoxigenin (diluted J:J00 in PBS) for 30 
minutes at room temperature, biotinylated tyr amide (di- 
luted J:5 in PBS) for 15 minutes at room temperature, 
and peroxidase-labeled streptavidin for 35 minutes at 
room temperature. Diaminobenzidine was used as chro- 
mogen. Conuols for 1SH included staining of serial sec- 
tions with sense probe, an unrelated probe (EBER-J of 
the Epstein-Barr virus), aud omission of the probe in the 
hybn'dization mixture, with all other experimental con- 
ditions identical to the procedure described here. 



RESULTS 

Reverse Transcriptase Polymerase Chain Reaction 

All neuroendocrine tumors, but no nonneuroendocrine 
lung carcinomas, were positive for CgA mRNA (Fig. 1). 
Sst2 mRNA was amplified in 22 of 26 samples of neu- 
roendocrine turnor. The signals had variable intensities 
(fig. 2) and were weak in moderately or poorly differ- 
entiated tumors (mostly in SCCs). No amplification was 
obtained in no-template or no-reverse transcriptase ex- 
periments. Control samples (nonneuroendocrine lung 
carcinomas proven by negative CgA RT-PCR) were 
weakly positive for sst2 i n 5 of 20 samples only (includ- 
ing 3 adenocarcinomas, J squamous, and the large-ceD 
anaplastic carcinoma) (Fig. 3). These differences were 
statistically significant (P < 0.01) by X 2 test 

Characterization of Monoclonal Antibodies to sstZ 

Several clones were identified having a positive bind- 
ing by enzyme-linked immunosorbent assay and a par- 
allel immunore activity on formalin- fixed paraffin- 
embedded human endocrine tissues (pituitary and pan- 
creatic islets). In Western blotting experiments, two 
clones (coded 10C6 and 10G4) specifically developed a 
band at approximately 70 kJX When the antibodies were 
used against CHO-transfected cells expressing recombi- 
nant somatostatin receptors 1 through 5, a specific band 
corresponding to sst2 (at approximately 70 kD) was re- 
vealed by the monoclonal antibody 10G4. Monoclonal 
antibody 10C6 developed a strong band with sst2 but- 
displayed a weaker reactivity also with sstl, 3 T and 5, al 
least in the present experimental conditions (Fig. 4 A3). 
The same antibodies were also tested by means of iin- 
rounoperoxidase staining, on formalin- fixed, paraffin- 
embedded samples of "normal human pituitary glsnd and 
pancreas. Monoclonal antibody 1QG4 gave good Jesuits 
in immunohistochernistry arid was used at increasing di- 




I 2 3 4 5 6 7 8 9 10 13 12 13 J4 15 16 37 J8 39 20 21 22 23 24 25 26 C C* M 

FIG. 1. Reverse transcriptase polymerase chain reaction lor chromograniri A (CGA) mRNA 
in 26 samples ot neuroendocrine tumor ot the lung. Numbers in each lane correspond to 
sample numbers in Table 2. CGA mRNA is amplified at 429 bp. C and C* stand tor positive 
(neuroendocrine colon carcinoma cell line, H716) and negative (distilled water) controls, 
respectively. The las! column to the right represents the molecular weight marker. Ail samples 
are positive with a variable intensity ot the amplification band.. 
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j 2 3 4 5 6 7 8 9 10 Jl 12 J3 14 15 16 17 1819 20 21 22 23 24 25 26 C C* 
FIG. 2. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 26 samples of 
neuroendocrine tumor of the lung. Numbers in each lane correspond to sample numbers in 
Table Z sst2 mRNA is amplified at 284 bp. C and C* stand for positive (a neuroblastoma) 
and negative (distilled water) controls, respectively. The last column to the right represents 
-the molecular weight ma rker. Twenty-two of 26 samples are positive with a va riable Intensity 
of the amplification band. 
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luDons(np to 1:15,000) with specific staining. Using thin 
sections (approximately 4 p.m), a strong membrane- 
bound and peripheral cytoplasmic immnnoreactivity was 
found in an adenohypophyseal cell population (corre- 
sponding to growth hormone- secreting cells, as con- 
firmed by double immunohistochemical analyses) and in 
pancreatic islets (Fig. 4 CJ>). In the latter, the staining 
was apparently not restricted to a specific hormone- 
producing cell type and had a peripheral cytoplasmic or 
membrane distribution. Exocrine pancreatic cells (both 
acinar and ductal) were only occasionally immuno- 
stained. lmmunohisiochemistry performed on serial con- 
trol sections, either omitting the primary antibody or us- 
ing the preimmune serum or antibodies preabsorbed with 
the synthetic peptide, was negative in both tissues. 
Monoclonal antibody 10C6 had a relaiively higher back- 
ground staining at similar dilutions. 



lmmunobistochemistry - 

The antibodies to ssi2 (monoclonal antibody )0G4 and 
polyclonal K230) gave slightly different imrnunoreac- 
tions in 25 samples, and staining was not done in 1 
sample because of lack of residual paraffin blocks. The 
monoclonal antibody 10G4 stained 21 of 25 samples, the 



uegarive samples being 1 atypical carcinoid and 3 SCCs 
(Fig. 5). The rumors had 5?o to 25% of the neoplastic 
cells imrmmoreactive. The ; staining was at the periphery 
of the cytoplasm, and omitting the counterstain its mem- 
brane-bound distribution was better outlined in most 
samples (Fig. 6). One sample of atypical carcinoid <no. 

21) was focally immunoreactive for sst2, despite nega- 
tive RT-PCR findings. Conversely, sample no. 26 was 
inununohistochernistry negative and RT-PCR positive. 
The antiserum anti-sst2A (code K230) gave positive sig- 
nal in 1 9 of 25 samples, in 5% to 60% of the neoplastic 
cell population (Fig. 7). The location of the staining was 
at the membrane level associated with a weak cytoplas- 
mic reactivity. The same pattern was seen in positive 
controls, e.g., pancreatic islets (Fig. 7, inset). Two 
samples (nos. 19 and 26) were negative in spite of a 
positive RT-PCR signal Two other tumors (nos. 21 and 

22) , apparently devoid of sst2 mRNA, showed a small 
percentage of immunoreactive cells. Incidentally, one of 
these latter samples (no. 21) was also immunoreactive 
with monoclonal antibody 10G4 (Table 2). 

The five control samples positive 'by RT-PCR were 
also reactive with the antibodies. The typ e °* immuno- 
cytocbemical location of ssi2 receptors was similar to 
that described here, being a peripheral cytoplasmic stain- 




M 1 23 4 5 6 7 8 9 10 Jl 32 J3 14 25 36 17 18 J9 20 C C* M 

r!G. 3. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 20 control 
samples 61 nonneuroendocrine lung carcinoma. Five ol 20 samples show a weak band at 
264 bp corresponding to sst2 mRNA. Control columns (C and C*) are identical to those in 

Fig. 2. 
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In Situ Hybridization 

Eight of 12 samples stained by 1SH were positive for 
sst2 mRNA. The mRNA was present in a percentage of 
cells (ranging from 10% to 40%) and gave a weak signal 
(Fig. 8), despite the amplification provided by the tyra- 
mide-based procedure. The background level was mini-, 
mal using diluted biotinylated tyramide. Control sections 
stained with sense probe or an unrelated probe, or omit- 
ting the probe, were consistently negative. 




Clinical Data 



FIG. 4. Western blot analysis of monoclonal antibody 
clones 10G4 and 10C6 against sst2 in Chinese hamster 
oyary cells transacted with recombinant sst 1 through 5 
(numbers of each column "correspond to receptor type). 
Monoclonal antibody 10G4 shows a specific band at ap- 
proximately 70 kD for sst2 only (A) as opposed to mono- 
clonal antibody 10C6, which strongry reacts with sst2 but 
also has some degrees of cross- reactivity with sst 1, 3, 
and 5 (B). The lower figures show control formalin-fixed 
paraffin-embedded pancreatic islets irnmunostained with 
monoclonal antibody 10G4 without (C) and with (D) 
preadsorption with the peptide antigen, respectively. The 
majority of endocrine cells show a membrane- bound im- 
muhoreactrviry (C) (immunop.eroxidase). Bar: 90 pm. 



ing present in 40% to 70% of neoplastic cells. A weak 
and focal staining was also observed in five of the re- 
maining RT-PCR-negative samples, when the antibody 
K230 was used (but not when the monoclonal was em- 
ployed). 

Several cells in per immoral tissues were occasionally 
stained Ciliated cells of bronchia! mucosa h3d a periph- 
eral staining at the cilia border. Mucous glands were 
negative. Rare chondrocytes had a membrane staining. 
The wall of periiumoral as well as of occasional distant 
vessels was srained a! the endothelium level and in oc- 
casional smooth muscle cells. 

. The jeaciiviry of both antibodies was abolished in se- 
rial sections when the reagents were preabsorbcd with 
the respective synthetic peptides, but not when an unre- 
lated peptide was used. The perirurooral bronchial mu- 
cosa had a focal staining of ciliated cells with both an- 
tibodies. This reactivity disappeared when the pre ab- 
sorbed ami body was apphed. 



Clinicopathologic data are summarized in Table 2. At 
follow-up, the majority of patients with typical carci- 
noids are free from disease 1 to 11 years after surgery. 
Two patients are alive with stable metastatic disease. 
Patients affected by atypical carcinoids had disease pro- 
gression in one third of samples. Finally, patients with SCC 
had fatal outcomes within 1 year from diagnosis (except the 
recent sample). Eight patients had preoperative octreotide 
scintigraphy performed at the time of diagnosis. All patients 
had positive octreoscan findings; *arjd, in these patients, 
also the tumor was positive by RT-PCR and immu- 




FIG. 5. sample no. 25 (small cell carcinoma). Absence of 
immunoreaclivity for sstJ? with the monoclonal 10G4. This 
sample was also negative by reverse transcriptase poly- 
merase chain reaction and in situ hybridization. (Immuno- 
peroxidase in a formalin fixed paraffin-embedded sample. 
Nuclei slightly counterstained with hemalum.) Bar: 45 pm. 
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^histochemistry or 1SR Id addition, three of these pa- 
tients received octreotide therapy administered at the 
time of rumor recurrence or metastatic spread. Stable 
disease is recorded at follow-up more than 5 years after 
diagnosis. 

Corrections 

Overall, complete overlapping (re., RT-PCR, 1SH, 
and immunobistochemistry with two antibodies) be- 
tween sst2 gene and protein expression was obtained in 
21 of 25 samples (84%) and between RT-PCR results 

^j^-Simmriomsiocbeiiiica] frndrngS-Nvith.-fltJeasl-oc 
the antibodies in 24 of 25 samples (96%). Tbe monoclo- 
nal antibody 10G4 looked highly sensitive, being able to 
stain all but one sample (do. 26) (95%) positive for sst2 
mRNA by RX-PCR. Sst2 expression, at mRNA as well 
as at protein levels, was reduced in high grade tumors, 

'' with SCCs being weakly positive in ODly two of five 
samples. Decreasing expression of sst2 appears to cor- 
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FIG. 6. sample no. 16 (typical carcinoid). Immunohislo- 
chemical detection of sst2 by means ol monoclonal anti- 
body 10G4, The neoplastic cells have a peripheral cyto- 
plasmic staining and membrane posilivity in some xells, 
whereas the peribronchial gland adjacent to the tumor is 
unreacffve. (Immunopercxidase in a formalin- fixed paraf- 
fin- embedded sample. Nuclei slightly counterstained with 
hemalum.) Bar: 45 pm. The membrane- bound distribution 
of tbe immunostaining is better outlined in a parallel sec- 
tion stained lor monoclonal antibody 10G4 omitting' 
nuclear counterstain (inset). 



FIG- 7. Same sample as in Fig. 6. tmmunohistocbemical 
detection ol sst2 by means of the polyclonal antibody 
K230. The immunostaining is more intense at the cell bor- 
der (arrows), as observed with the monoclonal antibody. 
In the inset,- a pancreatic islet, used as positive control, 
shws a predominant membrane-bound immunostaining of 
many neuroendocrine cells. (Immunoperoxfdase in a for- 
malin-fixed paraffin-embedded sample. Nuclei slightly 
counterstained with hemalum.) Ban 45 pm. 

relate with high tumor grade and elevated proliferative 
activity, but not with other parameters such sex, age, or 
tumor size. 

DISCUSSION 

In this study, the presence of sst2 mRNA has been 
demonstrated in a series of resected neuroendocrine ru- 
mors of the lung by. means of RT-PCR and confirmed by 
a sensitive nonradioactive tyramide-based ISH procedure 
and by immunohistochemistry with anti-sst2 antibodies. 
Samples of both carcinoid tumors and SCCs were sst2 
positive, although a reduced, or absent signal was ob- 
served in poorly differentiated (smalJrcel)) carcinomas. 
This is the first study of sst2 expression in a relatively 
laroe series' of neuroendocrine tumors of tbe lung. Single 
samples ol human carcinoids and SCCs (including cell 
lines of ihe latter) bad previously been analyzed and 
found to express sst2 (7 J 5^30,32,33,39,42). Several 
methods have been used to detect these receptors and 
partially overlapping result srwere obtained. 

In the present study, the expression of high amounts of 
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TABLE 2. Clinicopathoiogic data and somatostatin receptor type 2 (sst2) expression in 26 cases of neuroendocrine 

- lung tumors 
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Sex/age 


(cm) 


fmr\\ 
\TTK>) 


'no 


RT-PCR 


IHC 


IHC* 


RT-PCR 


Mab(10G4) 


(K230 Ab) 


1 


\Ktt~\ MCTT 


r/oo 




NED 90 






4- F 


1.5 


4-4-4 


+ 


+ 


2 . 


WD NtT 




** * 






44 


+ f 


3 


44 
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+ 


6 


WD NET 
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23 
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+ 
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4 F 


7 
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JF/69 


-3.5 


NED 47 


+ 


44 


4 


1 


4 




-I. 


8 


WD NET 
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3 
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+ 
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4- , 


-1 K. 
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+4-4 


+ 
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4 
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-f 


NT 


4-44- 


• + 
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10 
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AWD 55 
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1.1 


" 4-4 


4 r 


4 • 


11 
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F/29 


2 


A WD 56 


+ 
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+ 


2.5 


4-4+ 
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12 
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3 
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3 
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4 F 


19 
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5 
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4 


3 


++ 


4 




20 
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6 
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4 ■ 
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21 
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6 
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4 


44 


4 
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4 


23 
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4 
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44 
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M/68 
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4 


4 


44 
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AWD, alive with disease; CgA, chromogranin A; DOD, died of disease; F, focal: positive in <5% of cete; JHC; Immunohistochemistry; 
Mab, monoclonal antibody; NECa, Neuroendocrine carcinoma; NED, no evidence of disease; NET, neuroendocrine tumor; NT, not tested; 
R>PCR, reverse transcriptase polymerase chain reaction; SCC, Smat^cefl lung carcinoma; SYP, synaptophysin; Wp, weD differentiated. 
. " Ki67 IHC: values correspond lo percentage of positive nuclei of neoplastic cells. 

t Parents who had preoperative octreoscan performed. 

$ Patients who had octreoscan performed and octreotide treatment. 



sst2 mRNA was confirmed in well to moderately differ- 
entiated neuroendocrine tumors, in agreement with the 
results obtained by Reubi et al (32) by means of radio- 
active 1SH. The presence of sst2 mRNA in SCC had 
never been reported in human specimens, except for two 
samples included in. Reubi et al.'s series (32). Although 
the data on cell lines support the observation that SCCs 
contain sst2 (42), slightly discrepant results were found 
in some of samples, described here. Unfortunately, SCCs 
are rarely operated on, and therefore it is difficult to 
collect a large number of surgical specimens. The Jive 
samples studied in the current series by means of RT- 
PCR had a low amount (two samples) or absent (three 
samples) sst2 mRNA. This could be the result of the 
extensive necrosis commonly present in such rumor 
types. However, because care was taken to freeze frag- 
ments that weje.macroscopically devoid of necrotic ar- 
eas, a more likely hypothesis is that sst2 expression is 
reduced in poorly differentiated tumors. Recently, Reis- 
inger et aL (29) showed that the uptake of somatostatin 
analogues in patients with SCC undergoing chemo- 
therapy is significantly lower, and therapeutic externa) 
factors - may aJfect the receptor status of individual ru- 
mors. In addition, the uptake of somatostatin analogues 



in metastatic deposits of SCC has been shown to be low 
or absent (29,2). The present fundings suggest that the 
sst2 mRNA content is related to the degree of rumor 
differentiation. These data roust be confirmed in larger 
series of normeurbeDdocrine tumors to ascertain whether 
the observed loss or decrease of sst2 expression in neu- 
roendocrine rumors is a common, event linked to neo- 
plastic dedifferentianon. In addition, further studies are 
needed to assess the functionality of such receptors, by 
comparing the profile of sst2 expression in tumor tissues 
. with binding assays employing labeled somatostatin and 
with the clinical response to diagnostic and therapeutic 
administration of somatostatin analogues. 

To this purpose, several investigators have demon- 
strated a correlation between clinical imaging or re- 
sponse to somatostatin analogue treatment, and sst2 
mRNA content in single . samples of carcinoid rumors 
(15,22). Northern blotting and ISH weie the techniques 
used for sst2 mRNA identification. This kind of corre- 
lation is useful for selecting patients for somatostatin 
analogue treatment, although the demonstration of recep- . 
tor mRNA in a cell does not imply per se that the recep- 
tor is fully functional. 

The present study relied on a highly sensitive tech- 
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FIG. 8» sample no.- -14 (typical carcinoid). In : sttu^hybrid-~ 
ization (ISH) toe sst2 mRNA shows a weak cytoplasmatlc 
staining (A) in most tumor cells.. An ISH performed with an 
unrelated probe was negative in a serial section of the 
same tumor (B). This , sample was strongly positive by 
reverse transcriptase polymerase chain reaction for sst2 
mRNA and by immunohistochemistry. (Nonradioactive 
ISH revealed by peroxidase and diaminoben2idine, as 
substrate. Nuclei counterstained with hemalum.) Bar; 
75 pm. " . 

nique, RT-PCR, to identify all sampks bearing even 
small amounts of sst2 mRNA. Indeed, in a previous 
study, single samples exhibiting ocrjeotide-binding sites 
had no demonstrable sst2 mRNA by means of ISH, pos- 
sibly due to the low sensitivity of the ISH procedure (34). 
The RT-PCR bas sbown sst2 mRNA transcripts in tbe 
majority of samples here studied. Only four samples 
were negative, all belonging to poorly differentiated high 
grade tumors, which usually follow an aggressive course. 
A decrease of sst2 mRNA expression in association witb 
neuroendocrine tumor dedifferentiatJon had also been re- 
ported in neuroblastomas (37). In tbe above report, as 
well as in tbe current study, samples having an unfavor- 
able prognosis were found to contain a relatively low 
amount of sst2 mRNA, as compared witb weil- 
differentiaied tumors. 

In the current sample series, eight samples were in- 
vestigated before surgery with radiolabeled octreotide. 
Despite the low figures, all tbe samples positive at tbe 
diagnostic procedure had a strong RT-PCR signal for 
sst2 mRNA. Three of eight patients were also responsive 
to octreotide treauneni administered at tbe time of re- 
lapse or metastatic spread. More extensive correlative 
clinicopaihologic smdies on the sst status are needed to 
better define the tissue distribution, of somatostatin bind- 
ing sites and their potential clinical role in the Ueatment 
of patients. 

Sst2 evaluation .by means of ISH (J 4,32) or RT-PCR 
(26,37) is a highly sensitive and reliable procedure. Un- 
fortunately, these techniques have limiiations because 
hozen tissue is needed for some of them, and radioactive - 
Material or costly and lime- consuming medrods aie nec- 



essary for others. Immujiobistocberrjical analysis of sst2 
by means of specific antibodies represents an ideal, 
cheap, and rapid alternative, easUy applicable to archival 
material. For these reasons, several investigators have 
. raised polyclonal antibodies specific for sst (8,10,15,17, 
IS). In tbe current study, tested tumor fragments adjacent 
to those snap frozen for RT-PCR analysis were tested 
with a polyclonal antibody against a C-termina) portion 
of the sst2A splice variant (35,36). In addition,^ mono, 
clonal antibody was produced in the Department of Pa- 
thology (University of Turin) against an N-terminal se- 
quence of tbe human sst2. This antibody was tbe first 
monoclo n al devel oped against sst2 and was.sbownto be 
highly specific for sst2 in Western blot and iinrxjunobis- 
tocbemical analysis. Both the monoclonal and tbe poly- 
clonal antibodies specifically reacted with all. samples 
also positive by RT-PCR (with minor discrepancies in 
two samples, likely due to tumor heterogeneity). Tbe 
observed correlation between RT-PCR and iinmunohis r • 
tocbemistry indicates that tbe latter may be a rebable 
diagnostic tool and may allow unmunobistocbernical in- 
vestigation for sst2 even in small biopsy samples. This in 
turn may enable a rapid screening of sst2-positjve njmors 
for medical treatment with somatostatin analogues. 

Having confirmed in a relatively large series that tbe 
vast majority of. neuroendocrine tumors of the lung con- 
tain variable amounts of sst2 mRNA, a final comment is 
deserved for sst2 expression in nonneuroendocrine lung 
carcinomas. No data have been reported thus far in tbe 
literature concerning normal human lung, although in the 
present study some bronchial cells of perirumoral paren- 
chyma were positive for sst2 when immunohistocberni- 
cal analysis was performed with either antibody. Tbe 
staining was specific because it was abolished usiug pre- 
absorbed antibodies. Therefore, it is likely that normal 
human lung tissue contains sst2. This might be con- 
firmed by alternative techniques (e.g., Western blot, RT- 
PCR). However, in situ morphologic procedures, such as 
those employed here, have definite advantages. In fact, 
the Jung is rich in vessels, and in several tissues (either iri- 
tumoral or in mflanvjoiatory-reactive conditions) tbe ves- 
sels were rec endy shown to contain sst (6). 

A low expression of sst2 was found in 25% of lung 
caiciuomas of nonneuroendocrine type investigated in 
the present study by means of RT-PCR. Therefore, sst 
type 2, at least, does not appear to be extensively ex- 
pressed in nonneuroendocrine carcinomas of the lung. 
However, because two tumors in the control group (a 
squamous carcinoma and an adenocarcinoma, respec- 
tively) had positive ocueotide scintigraphy, but no sst2 
mRNA, it is plausible that a heterogeneous distribution 
of sst occurs in nonneuroendocrine lung rumors. Other 
receptor types may be expressed in these rumors arid may 
be responsible for die positive jesuJts in diagnostic test- 
ing. Because ssto. is also kjiown to bind somatostatin 
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analogues, such as octreotide, with high affinity (24), the 
expression of this receptor type will be investigated in 
forure studies. O 
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Expression of somatostatin receptor types 1-5 in 81 cases of 
gastrointestinal and pancreatic endocrine tumors. A correlative 
immunolrisiocheniical and revcrse-transcriptase polymerase chain 
reaction analysis. 

Papotti M, Bongiovanni M Volante M . AHia £ , Landolfi S . Helboe L , Schindler M 
Cole SL , Bussolari G . 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
10126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human tumors by 
means of autoradiography, revcrse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (ISH) and immunohistochemistry (IHC). We analyzed the SSTR type 
1-5 expression by means of RT-PCR and/or IHC in a series of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine tumors and related normal tissues. 
Moreover, we compared the results with clinical, pathological and hormonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies to SSTR types 2, 3, 5 and ISH for SSTR2 mRNA. 
The vast majority of tumors expressed SSTR types 1, 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to the good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, tfurty-five additional GEP endocrine tumors were studied with 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTR5 and 100% gastrinomas and glucagonomas 
expressed SSTR2. Pre-operative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work-up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2, 3 and 5 in surgical and biopsy specimens. 
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P-cadherm overexpression is an indicator of clinical outcome in invasive 
breast carcinomas and is associated with CDH3 promoter 
uyponictliylation. 

Paredes J , Albergaria A, Olivcira JT . Jcronimo C . Milanczi F . Schmitt PC . 

Institute of Pathology and Molecular Immunology of Porto University (IPATrMUP) 
Braga, Portugal, jparedes@ipatimup.pt 

PURPOSE: P-cadherin overexpression has been reported in breast carcinomas where it 
was associated with proliferative high-grade histological tumors. This study aimed to 
analyze P-cadhenn expression in invasive breast cancer and to correlate it with tumor 
markers, pathologic features, and patient survival. Another purpose was to evaluate the P- 
cadhenn promoter methylation pattern as the molecular mechanism underlying this gene 
regulation. EXPERIMENTAL DESIGN: Using.a series of invasive breast carcinomas P- 
cadlienn expression was evaluated and correlated with histologic grade estrogen 
receptor, MIB-1, and p53 and c-erbB-2 expression. In order to assess whether P-cadherin 
expression was associated with changes in CDH3 promoter methylation, we studied the 
methylation status of a gene S'-flanking region in these same carcinomas. This analysis 
was also done for normal tissue and for a breast cancer cell line treated with a 
dernethylating agent. RESULTS: P-cadherin expression showed a strong correlation with 
nigh histologic grade, increased proliferation, c-erbB-2 and p53 expression lack of 
estrogen receptor, and poor patient survival. This overexpression can be regulated by 
gene promoter methylation because the 5-Aza-2'-deoxycytidine treatment of MCF-7/AZ 
cells increased P-cadherin mRNA and protein levels. Additionally, we found that 71% of 
I -cadhenn-negative cases showed promoter methylation, whereas 65% of positive ones 
were unmethylated (P = (XOOS). The no*mal P-cadherin-negative breast epithelial cells 
showed consistent CDH3 promoter methylation. CONCLUSIONS: P-cadherin expression 
was strongly associated with tumor aggressiveness, being a good indicator of clinical 
outcome. Moreover, the aberrant expression of P-cadherin in breast cancer might be 
regulated by gene promoter hypomethylation. 



PMID: 1611 5928 [PubMed - in process] 



486: J Dairy Sci. 1992 Jun;7S(6): 1 423-9. Related Articles. Links 



Ma miliary -derived growth inhibitor protein and messenger ribonucleic 
acid concentrations in different physiological states of the gland. 

PoHtts I , Gorcvvtt RC , Mutler T, Grossc R . 

Department of Animal Science, Cornell University, Ithaca 14853. 

Expression of mammary-derived growth inhibitor in tissue from lactating and involuting 
bovine mammary glands was investigated- Seventeen lactating, pregnant (220 to 272 d in 
gestation) cows were divided in two groups of 8 and 9 cows each. Cows of the first group 
were slaughtered while in lactation. Cows of the second group (9 involuting cows) were 
slaughtered at 13 to 52 d following sudden cessation of milking. High concentrations of 
mammary-derived growth inhibitor (.63% of the total protein) were detected in mammary 
tissue of lactating cows. Manimary-derived growth inliibitor (less than .10% of the total 
protein) was dramatically reduced during most of the involution period (13 to 45 d 
following cessation of milking). Mammary-derived growth inhibitor was again detected 
(.28% of the total protein) during the last stage of the involution (46 to 53 d after 
cessation of milking), which coincided with colostrum formation. When steady state 
concentrations of mammary-derived growth inliibitor mRNA were examined, the results 
obtained mirrored those obtained at the protein concentration. These data suggest that 
regulation of mammary-derived growth inhibitor occurs via modulation of the steady 
state concentration of its rriRNA. Furthermore, there is a strong correlation between 
mammary-derived growth inliibitor expression and lactation in dairy cows. 
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T-cell receptor V beta-family usage in primaiy cutaneous and primary 
nodal T-cell non-Hodgkin's lymphomas. 

PrcesmanAH , Hu HZ, Tilanus MG. degcus B, Schunrman HX Reitsma R . van 
Wiclien DF . van Vloten WA de WegerRA . ~ ~ " 

Department of Pathology, University Hospital Utrecht, The Netherlands. 

To evaluate whether the expression of T-cell receptor (TCR) V beta families in eight 
cases of malignant T-cell lymphomas took place in a preferential manner, we analyzed 
four cases of mycosis fungoides (MF), the most common form of primary cutaneous T- . 
cell non-Hodgkin's lymphomas (NHL), and four cases of primary nodal T-cell NHL. The 
usage of V beta families in T-cell populations was investigated on mRNA that was 
transcribed to cDNA using a C beta primer and reverse transcriptase. Subsequently the 
specific usage of the families was analyzed by polymerase chain reaction (PCR) using 
combinations of the selected C beta-oligonucleotide primer and one of the family-specific 
V beta primers. Peripheral blood lymphocytes from four healthy volunteers and 1 
■reactive" lymph node served as a control and expressed all 20 V beta families tested for 
In T-cell lines, with restricted V beta expression, and in three patients with advanced MF 
only one or two V beta families were expressed at the mRNA level. In an early MF lesion 
this monoclonal expression was absent: several V beta families were expressed with a 
weak intensity. This may indicate either a polyclonal origin of MF, or that too few 
monoclonal neoplastic cells were present in the tissue specimen. In the four nodal T-cell 
NHL, only one family could be clearly distinguished, whereas some of the other V beta 
families showed only a weak expression. These latter families represent the reactive T- 
cell component in (he nodal T-cell NHL. Both in nodal T-cell NHL and in MF there was 
no preferential expression of a particular V beta family. There was a good correlation 
between PCR data and the expression of V beta-family protein products observed by 
immunohistocliemisuy on tissue sections of the T-cell lymphomas. All T-cell lines three 
cases of MF, and three cases of nodal T-cell NHL showed a rearrangement of the TCR 
beta chain on DNA level. 
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Matrilin-3 iu human articular cartilage: increased expression in 
osteoarthritis. 

P»tUg <X Weseloh G , Klatt AR . Wagener R, Swoboda B . 

Division of Orthopaedic Rheumatology, Department of Orthopaedics, University of 
Erlangen-Nuremberg, Rathsberger Str. 57, D-91054 Erlangen, Germany. 
oliver.pullig@med.ufii-erlangen.de 

OBJECTIVE: Matrilin-3 is a member of the recently described matrilin family of 
extracellular matrix proteins containing von Willebrand factor A-like domains. The 
matrilin-3 subunit can form homo-tetramers as well as hetero-oligomers together with 
subunits of matrilin- 1 (cartilage matrix protein). It has a restricted tissue distribution and 
is strongly expressed in growing skeletal tissues.. Detailed information on expression and 
distribution of extracellular matrix proteins is important to understand cartilage function 
. in health and in disease like osteoarthritis (OA). METHODS: Normal and osteoarthritic 
cartilage were systematically analysed for matrilin-3 expression, using 
irnmunohistochemistry, Western blot analysis, in situ hybrid izatibn, and quantitative 
PCR. RESULTS: Our results indicate that matrilin-3 is a mandatory component of 
mature articular cartilage with its expression being restricted to chondrocytes from the 
tangential zone and the upper middle cartilage zone. Osteoarthritic cartilage samples with 
only moderate morphological osteoarthritic degenerations have elevated levels of 
matrilin-3 mRNA. In parallel, we found an increased deposition of matrilin-3 protein in 
the cartilage matrix. Matrilin-3 staining was diffusely distributed in the cartilage matrix, 
widi no cellular staining being detectable. In cartilage samples with minor osteoartliritic 
lesions, matrilin-3 deposition was restricted to the middle zone and to the upper deep 
zone. A strong correlation was found between enhanced matrilin-3 gene and protein 
expression and the extent of tissue damage. Sections with severe osteoarthritic 
degeneration showed the highest amount of matrilin-3 mRNA, strong signals in in situ 
hybridization, and prominent protein deposition in the middle and deep cartilage zone. 
CONCLUSION: We conclude that matrilin-3 is an integral component of human articular 
cartilage matrix and that the enhanced expression of matrilin-3 in OA may be a cellular 
response to the modified microenvironment in the disease. Copyright 2002 OsteoArthritis 
Research Society International. 



PMID: 1 1950247 [PubMed - indexed for MEDLINE] 



222: B iochem Phar macol. 2000 Dec l;60(l I): 1 639-46 Related Articles, Links 

{ELS EV't E Hi 

Up-regulation of mitochondrial peripheral benzodiazepine receptor 
expression by tumor necrosis factor alpha in testicular leydig cells. 
Possible involvement in cell survival. 

R C Y C , MaaduitC . Naureils Q . Benahmcd M Louisot P , Gasnier F . 

INSERM U. 189, Faculte de Medecine Lyon-Sud, BP12, 69921 cedex, Oullins, France. 

Porcine Leydig cells in primary cultures are resistant to tumor necrosis factor alpha 
(TNFalpha) cytotoxicity. Here we report that these cells can be rendered sensitive to 
TNFalpha killing by treatment with the translation^ inhibitor cycloheximide, suggesting 
the existence of proteins that can suppress the death stimulus induced by the cytokine. In 
search of these cytoprotecti ve proteins, we focused on the constituents of the 
mitochondria] permeability transition pore (PT pore), whose opening has been shown to 
play a critical role in the TNFalpha-mediated death pathway. We found that TNFalpha 
up-regulated mRNA and protein expression of the mitochondrial peripheral 
benzodiazepine receptor (PBR), an outer membrane-derived constituent of the pore. A 
strong correlation was established between the resistance of the cells to TNFalpha killing 
and the density of PBR-binding sites. Concomitantly, TNFalpha down-regulated Bcl-2 
mRNA and protein expression. As Bcl-2 has been shown to be an endogenous inhibitor 
of the PT pore, we hypothesize that the TNFalpha-induced up-regulation of PBR 
expression may compensate for the decrease in Bcl-2 levels to prevent the opening of the 
PT pore. 
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GLUTl messenger RNA and protein induction relates to the malignant 
transformation of cervical cancer. 

Rudlowski C, Becker A J . Schroder W . Rath W . Buttncr R , Moser M . 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
69 11 5 Heidelberg, Germany. 

We studied whether induction of glucose transporters (GLUTs) I to 4 correlates with 
human papillomavirus (HPV)-dependent malignant transformation of cervical epithelium 
Tissue samples of cervical intraepithelial neoplasia (CIN; grades 1 to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing performed 
Tissue sections were immunoscained with GLUTl to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUT 1 protein expression. Weak 
expression of GLUTl was found in nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions correlating with 
the degree of dysplasia. In CIN 3 high-risk HPV lesions, cervical cancer, and metastasis 
GLUTl was expressed at highest levels with a strong correlation of GLUTl mRNA and 
protein expression. Immunostains for GLUT2 to GLUT4 were negative. Cervical tumor 
cells respond to enhanced glucose utilization by up-regulation of GLUTl The strong 
induction of GLUTl mRNA and protein in HPV-positive CIN 3 lesions suggests GLUTl 
overexpression as an early event in cervical neoplasia. GLUTl is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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Expression aad distribution of I am in in alphal and alpha2 chains in 
embryonic and adult mouse tissues: an immunochemical approach. 

S^fcLL GUta^J*, TaUsJU, Tcm|d_R, TaltsJF. 

Max-Planck-fcistitute for Biochemistry, Martinsried, D-82I52, Germany. 

Protein levels, mRNA expression, and localization of laminin alphal and alpha2 chains in 
development and in adult mice Were examined. Recombinant fragments were used to 
obtain high-titer-specific polyclonal antibodies for establishing quantitative 
radioimmuno-inhibition assays. This often demonstrated an abundance of alpha2 chain, 
but also distinct amounts of alpha I chain for adult tissues. The highest amounts of alphal 
were found in placenta, kidney, testis, and liver and exceeded those of alpha2. All other 
tissue extracts showed a higher content of alpha2, which was particularly high in heart 
and muscle when compared to alphal. Content of gamma! chain, shared by most 
laminins, was also analyzed. This demonstrated gamma I chain levels being equal to or 
. moderately exceeding the sum of alphal and alpha2 chains^ indicating that these isoforms 
represent the major known laminin isoforms inmost adult mouse tissues so far examined. 
Moreover, we found good correlation between radioirnmuno-inhibition data and mRNA 
levels of adult tissues as measured by quantitative real-time reverse transcriptase-PCR. 
Embryonic tissues were also analyzed by radioirnmuno-inhibition assays. This 
demonstrated for day 1 1 embryos comparable amounts of alphal and gammal and a 
more than 25-fold lower content of alpha2. This content increased to about 10% of 
alphal in day 13 embryos. The day 18 embryo showed in heart, kidney, and liver, but not 
yet in brain and lung, alpha l/alpha2 chain ratios comparable to those in adult tissues. 
Immunostaining demonstrated alphal in Reichert's membrane (day 7.5), while alpha2 
could not be detected before day 1 1 . 5. These data were compared with 
immunohistochemical localization results on several more embryonic and adult tissue 
sections. Our results regarding localization are consistent with those of earlier work with 
some notable exceptions. This was in part due to epitope masking for monoclonal 
antibodies commonly used in previous studies in esophagus, intestine, stomach, liver, 
kidney, and spleen. 
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Discordant regulation of granzynie H and granzyme B expression in 
human lymphocytes. 

Sedelies KA % Sayers TJ, Edwards KM , Chen W, Pcllicci DG / Godfrey PI Trapaai 
JA 

Cancer Immunology Laboratory, Peter MacCalium Cancer Centre, Locked Bag 1 , 
A'Beckett Street, East Melbourne, 8006, Australia. " . . 

We analyzed the expression of granzyme H in human blood leukocytes, using a novel 
monoclonal antibody raised against recombinant granzyme H. 33-kDa granzyme H was 
easily detected in unfractionated peripheral blood mononuclear cells, due to its high 
constitutive expression in CD3(-)CD56(+) natural killer (NFC) cells, whereas granzyme B 
was less abundant The NK lymphoma cell lines, YT and Lopez, also expressed high 
granzyme H levels. Unstimulated CD4(4-) and particularly CD8(+) T cells expressed far 
lower levels of granzyme H than NBC cells, and various agents that classically induce T 
cell activation, proliferation, and enhanced granzyme B expression failed to induce 
granzyme H expression in T cells. Also, granzyme H was not detected in NK T cells, 
monocytes, or neutrophils. There was a good correlation between mRNA and protein 
expression in cells that synthesize both granzymes B and H, suggesting that gzmH gene 
transcription is regulated similarly to gzmB. Overall, our data indicate that although the 
gzmB and gzmH genes are tighdy linked, expression of the proteins is quite discordant in 
T and NK cells. The finding that granzyme H is frequently more abundant than granzyme 
B in NK cells is consistent with a role for granzyme H in complementing the pro- 
apoptotic function of granzyme B in human NK cells. 
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BCL2 protein expression parallels its mRNA level in normal and 
malignant B cells. 

Shen \\ Iqbai J , Huang JZ , Zhou G , Chan WC. 

Department of Pathology and Microbiology, University of Nebraska Medical Center, 
Omaha, USA. 

The regulation of B-cell lymphoma 2 (BCL2) protein expression in germinal center (GC) 
B cells has been controversial. Previous reports have indicated posttranscriptional 
regulation plays a dominant role. However, a number of recent studies contradicted these 
reports. Using real-time polymerase chain reaction (PCR) and Standardized Reverse 
Transcriptase-PCR (StaRT-PCR), we measured the level of mRNA expression in GC, 
mantle zone (MNZ), and marginal zone (MGZ) cells from laser capture microdissection. 
Both quantitative RT-PCR measurements of microdissected GC cells from tonsils 
showed that GC cells had low expression of BCL2 transcripts commensurate with the 
low protein expression level. These results are in agreement with microarray studies on 
fluorescence-activated cell sorter (FACS)-sorted cells and microdissected GC cells. We 
also examined BCL2 mRNA and protein expression on a series of 30 cases of diffuse 
large B-cell lymphoma (DLBCL) and found, in general, a good correlation. The results 
suggested that BCL2 protein expression is regulated at the transcriptional level in normal 
B cells and in the neoplastic cells in most B-cell lymphoproliferative disorders. 
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Quantitative determinations of the steady state transcript levels of 
hexokinase isozymes and glucose transporter isoforms in normal rat 
tissues and the malignant tumor cell line AH130. 

Shinohara Y, Yamamoto IC InooJC, Yamazaki N. Terada HL 

Faculty of Pharmaceutical Sciences, University of Tokushima, Japan. 
yasuo@ph;tokushitna-u.acjp 

The steady state transcript levels of the four hexokinase (HK) isozymes and four glucose 
transporter (GLUT) isoforms were determined quantitatively by Northern analysis of 
RNA samples from rat tissues using synthetic fragments of the RNAs encoding the HK 
isozymes and GLUT isoforms. Results showed that the levels of HK. isozyme transcripts 
were low in rat tissues, the level of that most highly expressed, the type I isozyme (HKI), 
m the brain being 0.025% of the total po!y(A)+ RNA. A good correlation was found 
between the reported HK activities and the total amounts o f transcripts encoding all HK 
isozymes in various tissues, showing that the HK activities in tissues can be estimated 
from the total amount of transcripts encoding HK isozymes. The proposed associated 
expressions of HK isozymes and GLUT isoforms in particular tissues were confirmed at 
their transcript levels. The steady state transcript levels of type II HK and the type I 
GLUT isoform in the malignant tumor cell line AH130 were also determined 
quantitatively. 
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UVA irradiation-induced activation of activator protein- 1 is correlated 
with induced expression of AP-l family members in the human 
keratiuocyte cell line HaCaT. 

Silvers AL. Bowdcn GT . 

Department of Radiation Oncology, Arizona Cancer Center, The University of Arizona, 
• Tucson 85724, US A. 

To determine whether the transcription factor activator protein- 1 (AP-l) could be 
modulated by ultraviolet A (UVA) exposure, we examined AP-l DNA-binding activity 
and transactivation after exposure to UVA in the human immortalized keratinocyte cell , 
line HaCaT. Maxima! AP-l transactivation was observed with 250 kJ/m2 UVA between 
3 and 4 h after irradiation. DNA binding of AP-l to the target 12-0- 
tetradecanoyiphorbol-l 3 -acetate response element sequence was maximally induced 1-3 
h after irradiation. Both de novo transcription and translation contributed to the UVA- 
induced AP-i DNA binding. c-Fos was implicated as a primary component of the AP-l 
DNA-binding complex. Other components of the complex included Fra-2, c Jim, JunB 
and JunD. UVA irradiation induced protein expression of c-Fos, c-Jun, Fra-1 and Fra-2. 
Phosphorylated forms of these induced proteins were determined at specific time points. 
A strong correlation existed between UVA-induced AP-l activity and accumulation of c- 
Fos, c-Jun and Fra-1 proteins. UVA irradiation also induced c-fos and c-jun mRNA 
expression and transcriptional activation of the c-fos gene promoter. These results 
demonstrate that UVA irradiation activates AP-l and that c-fos induction may play a 
critical role in the response of these human keratinocytes to UVA irradiation. 
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Rat kidney glutamyl amiaopeptidase (aminopeptidase A): molecular 
identity and cellular localization. 

SongJL, YcM, Trovanovskay a IVf Wilk E . WUkS, Hcaly DP . 

Department of Pharmacology, Mount Sinai School of Medicine, City University of New 
York, New York 10029. . 

Glutamyl aminopeptidase [aminopeptidase A (EAP), EC 3.4. 1 1.7] is ah ectoenzyme that 
selectively hydrolyzes acidic amino acid residues from thcamino terminus of 
oligopeptides. EAP activity is highest within the kidney and small intestine. The murine 
pre-B cell BP-1/6C3 and the human kidney glycoprotein gpl60 differentiation antigens 
have been reported to have biochemical properties indistinguishable from EAP ft is not 
known, however, if rat kidney EAP is a homologueof these antigens or molecularly 
distinct. Using the reverse transcription-polymerase chain reaction method with 
oligonucleotide primers based on the BP-1/6C3 nucleotide sequence, we isolated a 450- 
bp partial cDNA from rat kidney poly(A)^ RNA. The partial cDNA encoded a predicted 
protein that was 92% and 86% identical to the murine BP-1/6C3 and human gp 160 
antigens, respectively; the amino acid sequence within the zinc-binding domain was 
completely conserved. Purification of EAP from rat kidney and microsequence analysis 
of a tryptic digest peptide fragment (18-mer) indicated that the fragment was highly 
similar to a region within the BP-1/6C3 and gpl60 proteins. Northern blot hybridization 
and immunoblot analyses were also consistent with labeling of products the same size as 
reported for the BP-1/6C3 and gpl60 antigens. There was a good correlation between the 
cellular distribution of EAP mRNA and EAP immunoreactivity, with proximal tubules 
and glomerular mesangial cells having the highest densities. These results indicate that 
rat kidney EAP is a species homologue of the murine BP-1/6C3 and human gp!60 
antigens. Furthermore, on the basis of its cellular localization, rat kidney EAP is likely to 
be involved m degradation of oligopeptides within the glomerulus and the glomerular 
filtrate. Since cells that express EAP also express receptors for angiotensin II, an 
mtrarenal vasoactive hormone that is a substrate for EAP, these results further suggest 
that EAP may play a role in modulating the activity of intrarenal angiotensin II. 
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Tumor necrosis factor-alpha uprcgulatcs the prostaglandin E2 EPl 
receptor subtype and the cycIooxygcnasc-2 isoform in cultured amnion 
WISH cells: 

Spaziani EP , Ccnoit RR , Tsibris JC , Gould SE . O'Brien 'WF . 

University of South Florida Health Science Center, Department of Obstetrics 8c 
Gynecology, Tampa 33612, USA. espazian@cbml.med.usf.edu 

Recent studies have demonstrated a strong correlation between infection and preterm 
labor. Preterm delivery is also associated with high levels of cytokines and prostaglandins 
in amniotic fluid. The purpose of this study was to investigate the effect of tumor necrosis 
factor-alpha (TNF-alpha) on the levels of cyclooxygenase, prostaglandin E2 production 
(PGE2), and expression of the PGE2 receptor subtype EP 1 in amnion WISH cell culture. 
Amnion WISH cell cultures were incubated in increasing concentrations of TNF-alpha 
(0-50 ng/ml). Changes in cyclooxygenase and EP I receptor proteins were evaluated by 
Western blot analysis. Changes in EPl mRNA were evaluated by Northern blot, and 
culture fluid concentrations of PGE2 were estimated by enzyme immunoassay (ElA). 
-EPl protein (p<0.01), EPl mRNA (p<0.05), cyclooxygenase-2 (COX-2) protein 
(p<0.00 1 ), and PGE2 concentrations (p<0.0 1 ) all increased with increasing 
concentrations of TNF-alpha. Changes in COX-1 protein were not observed following 
TNF-alpha-incubation. The results. suggest that TNF-alpha may play a role in infection- 
induced preterm labor by its pleiotropic ability to simultaneously stimulate COX-2 
activity, PGE2 concentrations, and PGE2 EPl receptor levels in human amnion. 
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Transcriptional activity of potent glucocorticoids: relevance of 
glucocorticoid receptor isoforms and drug metabolites. 

Sptka HammerS. Kleuser B , Korting HC . Schafer-Kortmg M 

Institut fur Pharmazie, Abteilung fur Pharmakologie und Toxikologie, Freie Universitat 
Berlin, Berlin, Germany. 

As compared to standard glucocorticoids (GC), prednicarbate (PC) is favorable in the 
treatment of eczema due to its high benefit/risk ratio. The remarkable anti-inflammatory 
effects of PC are in strong contrast to its reported low glucocorticoid receptor (GR) 
binding affinity. In transfected COS-7 cells we related the transcriptional potencies of 
PQ its metabolites and conventional GC to their receptor binding properties. Moreover, 
the expression pattern of the human GR isoform hGRalpha and its mutual dominant 
negative inhibitor hGRbeta in skin cells have been investigated as well as the influence of 
hGRbeta on receptor binding and transactivation. hGRalpha mRNA and protein was 
largely overexpressed in skin cells. hGRbeta showed no influence on hGRalpha binding 
and transactivation. Concentration response curves indicated the greater transactivation 
potency of betamethasone 17- valerate followed by dexamethasone and prednisolone 17- 
ethylcarbonate. Native PC appeared almost as potent as dexamethasone. With both a 
strong correlation was observed between transactivation and GR binding. Copyright 2003 
S. Karger AG, Basel 
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Specific inhibition of AQP1 water chauuels in isolated rat intrahepatic bile 
duct units by small interfering RNAs, 

Splinter PL , Masyuk AI . LaRusso NF . 

Center for Basic Research ia Digestive Diseases, Division of Gastroenterology and 
Hematology, Mayo Medical School, Clinic, and Foundation, Rochester, Minnesota 55905 
USA. ' 

Cholangiocytes express water channels (i.e, aquaporins (AQPs)), proteins that are 
increasingly recognized as important in water transport by biliary epithelia. However, 
direct functional studies demonstrating AQP-mediated water transport in cholangiocytes 
are limited, in part because of the lack of specific AQP inhibitors. To address tins issue, 
we designed, synthesized, and utilized small interfering RNAs (si RNAs) selective for 
AQP I and investigated their effectiveness in altering AQP I -mediated water transport in 
intrahepatic bile duct units (IBDUs) isolated from rat liver. Twenty-four hours after 
transfection of IBDUs with siRNAs targeting two different regions of the AQP 1 
transcript, both AQP i mRNA and protein expression were inhibited by 76.6-92.0 and 
57.9-79.4%, respectively. siRNAs containing the same percent of base pairs as the 
AQP 1 -siRNAs but in random sequence (i.e. scrambled siRNAs) had no effect. 
Suppression of AQP I expression in cholangiocytes resulted in a decrease in water 
transport by IBDUs in response to both an inward osmotic gradient (200 mosm) or a 
secretory agonist (forskolin), the osmotic water permeability coefficient (P(f)) decreasing 
up to 58.8% and net water secretion (J(v)) decreasing up to 87%. A strong correlation 
between AQP I protein expression and water transport in IBDUs transfected with AQP 1- 
siRNAs was consistent with the decrease in water transport by IBDUs resulting from 
AQP1 gene silencing by AQP 1 -siRNAs. This study is the first to demonstrate the 
feasibility of utilizing siRNAs to specifically reduce the expression of AQPs in epithelial 
cells and provides direct evidence of the contribution of AQP 1 to water transport by 
biliary epithelia. 
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Type IV collagenase (M(r) 72,000) expression in human prostate: benign and 
malignant tissue. 

Stearns MIL Wang IVf . 

Department of Pathology, Medical College of Pennsylvania, Philadelphia 191 29. 

The expression of type IV collagenase (M(r) 72,000) has been examined in tissues from patients 
with benign prostatic hyperplasia (6 patients) and varying Gleason grades of malignant prostate 
cancer (1 8 patients). Immunoperoxidase labeling indicated that expression of the type IV 
collagenase was weak or nonexistent in benign tissue but consistently strong- in the glandular and 
ductal epithelial cells of prostate tumors diagnosed at Gleason grades 1-8. [n moderate to 
advanced cancer (Le., Gleason grades 2 to 8), invasive tumor foci in the stromal tissue produced 
relatively modest amounts of type IV collagenase. The normal stromal tissue (Le,, fibroblasts) 
uniformly failed to produce detectable levels of type IV collagenase in the 24 patients examined. 
Northern and quantitative slot blot hybridization assays demonstrated that collagenase type [V 
mRNA levels were low in benign tissue and high in malignant tumors. In contrast, the stromal 
cells did not express significant amounts of type IV collagenase mRNA. En2yme- linked 
immunosorbent assays demonstrated that the amounts of type IV collagenase protein correlated 
directly with the mRNA levels in the tumor tissue. The studies suggest that type IV collagenase 
may be selectively overexpressed by malignant, preinvasive prostatic epithelial cells. 
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The decompensated detrusor III: impact of bladder outlet obstruction on 
sarcoplasmic endoplasmic reticulum protein and gene expression. 

Stein R , Gong C. Hutchesou JC Canning DA, Zderic S A . 

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, 
USA. 

PURPOSE: Regulation of calcium ion homeostasis has a significant role in smooth 
muscle contractility. The sarcoplasmic endoplasmic reticulum, calciunii magnesium, 
adenosine triphosphatase (SERCA) is a regulatory ion pump that may have a role in the 
functional outcome after outlet obstruction. We investigate what correlation if any existed 
between SERCA protein and gene expression, and the contractile properties in the same 
bladder. MATERIALS AND METHODS: Standardized partial bladder outlet 
obstructions were created in adult New Zealand white rabbits, which were divided into 
control, sham operated and obstructed groups. Muscle strip studies subcategorized the 
obstructed group into compensated (force greater than 50% of control) and 
decompensated (force less than 50% of control). Microsomal membrane and total RNA 
fractions were prepared from the same bladder tissue. Membrane proteins were used for 
Western blot analysis using a SERCA specific monoclonal antibody, and total RNA was 
assessed with Northern blot analysis. RESULTS : The relative intensities of signals for 
the Western and Northern blots demonstrated a strong correlation between protein and 
gene expression. Furthermore there was a strong association between the loss of SERCA 
messenger RNA and protein expression and loss of bladder function. CONCLUSIONS; 
Bladder contractility after outlet obstruction is influenced in part by smooth muscle cell 
ability to maintain calcium homeostasis via SERCA, The loss of SERCA protein 
expression is mediated by down-regulation in gene expression in the same bladder. These 
data suggest that smooth muscle ion pump gene expression is in part mechanically 
(pressure work) regulated. 
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TNF-alpha and IL-8 arc upregulatcd in the epidermis of aormal human 
skin after LfVB exposure: correlation with neutrophil accumulation and E- 
selectiu expression. 

Strickland ( , Rhodes LE , Flanagan BF, Friedmann PS . 

Department of Dermatology, University of Liverpool, United Kingdom. 

The in vivo response to ultraviolet B (UVB) radiation in skin is characterized by die 
accumulation of both mononuclear and polymorphonuclear cells within the dermis and an 
induction of vascular endothelial adhesion molecules. Epidermal production of cytokines 
(IL-8 and TNF-alpha) has been strongly implicated in the development of UVB-induced 
inflammation. In the current study, we examined the time course of IL-8 and TNF-alpha 
mRNA and protein expression in the epidermis over a 24-h period after in vivo UVB 
irradiation. Also, the induction of adhesion molecule expression and the accumulation of 
neutrophils within the dermis were followed. We found constitutive expression of both 
cytokines (mRNA and protein) in the epidermis of unirradiated skin. IL-8 was rapidly 
upregulated after irradiation and mRNA and protein increased at 4 h, reaching a 
maximum between 8 and 24 h. TNF-alpha mRNA and protein was minimally increased 
by 8 h after UVB irradiation and reached a maximum by 24 h. No significant alteration in 
ICAM-l or VCAM-l expression was observed. E-selectin expression, which was absent 
from control samples, was increased from 4 h onward and also reached a maximum at 24 
h, coinciding with peak neutrophil accumulation. A strong correlation (r = 0.96) was 
found between number of E-selectin-positive vessels and numbers of infiltrating 
neutrophils at this time. Moreover, because E-selectin expression was increased before 
any apparent increase in TNF-alpha protein (4 h), TNF-alpha does not appear to be 
involved in the early induction of the adhesion molecule, but cytokines such as TNF- 
alpha and IL-8 may act subsequently to augment the inflammatory response. 
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Basic fibroblast growth factor expression is increased in human renal 
fibrogenesis and may mediate autocrine fibroblast proliferation. 

StrutaF, Zeisberg M , Bemmerlein B, SartJer B. Hummel K Becker V. Muller GA . 

Department of Nephrology and Rheumatology, Georg- August-University Gottingen, 
Germany, fstrutz@gwdg.de 

BACKGROUND: Interstitial fibroblasts play a critical role in renal fibrogenesis, and 
autocrine proliferation of these cells may account for continuous matrix synthesis. Basic 
fibroblast growth factor (FGF-2) is mitogenic for most cells and exerts intracrine, 
autocrine, and paracrine effects on epithelial and mesenchymal cells. The aims of the 
present studies were to localize and quantitate the expression of FGF-2 in normal and 
pathologic human kidneys and to study the in vitro effects of FGF-2 on proliferation, 
differentiation, and matrix production of isolated cortical kidney fibroblasts. METHODS: 
FGF-2 protein expression was localized by immunofluoresence double labelings in 
normal and fibrotic human kidneys. Subsequently, interstitial FGF-2 labeling was 
determined, semiquantitatively in 8 normal kidneys and 39 kidneys with variable degrees 
of interstitial fibrosis and was correlated with the morphometrically determined 
interstitial cortical volume. In addition, FGF-2 expression was quantitated by immunoblot 
analysis in three normal and six fibrotic kidneys. FGF-2 mRNA was localized by in situ 
hybridizations. Seven primary cortical fibroblast lines were established, and expression of 
FGF-2 and FGF receptor-] (FGFR-1) were examined. The effects of FGF-2 on cell 
proliferation were determined by bromodeoxyuridine incorporation and cell counts, those 
on differentiation into myofibroblasts by staining for alpha-smooth muscle actin, and 
those on matrix synthesis by enzyme-linked immunosorbent assay for collagen type I and 
fibronectin. Finally, proliferative activity in vivo was evaluated by expression of MIB-1 
(Ki-67 antigen). RESULTS: In normal kidneys, FGF-2 expression was confined to 
glomerular, vascular, and a few tubular as well as interstitial fibroblast-Iike cells. The 
expression of FGF-2 protein was increased in human kidneys, with tubulointerstitial 
scarring correlating with the degree of interstitial fibrosis (r = 0.84, P < 0.01 ). 
Immunoblot analyses confirmed a significant increase in FGF-2 protein expression in 
kidneys with interstitial scarring. In situ hybridization studies demonstrated low-level 
detection of FGF-2 mRNA in normal kidneys. However, FGF-2 mRNA expression was 
robustly up-regulated in interstitial and tubular cells in end-stage kidneys, indicating that 
these cells are the source of excess FGF-2 protein. Primary cortical fibroblasts express 
FGF-2 and FGFR-1 in vitro. FGF-2 induced a robust growth response in these cells thai 
could be blocked specifically by n neutralizing FGF-2 antibody. Interestingly, the 
addition of the neutralizing antibody alone did reduce basal proliferation up to 3] .5%. In 
addition, FGF-2 induced expression of alpha-smooth muscle actin up to 3 .6- fold, but no 
significant effect was observed on the synthesis of collagen type 1 and fibronectin. 
Finally, staining for MIB-1 revealed a good correlation of interstitial FGF-2 posilivity 



with interstitial and tubular proliferative activity (r = 0.71 , P < 0.01 for interstitial 
proliferation, N = 30). CONCLUSIONS: Interstitial FGF-2 protein and mRNA 
expression correlate with interstitial scarring. FGF-2 is a strong mitogen for cortical 
kidney fibroblasts and may promote autocrine fibroblast growth. Expression of FGF-2 
correlates with interstitial and tubular proliferation in vivo. 
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Adiposity elevates plasma MOM levels leading to the increased CD1 lb- 
positive monocytes in mice. 

Takahashi K. Mizuarai S . Araki H , Maskiko S . hhihara A . Kanatani A . Kadani H 
Kotani H . 

Banyu Tsulcuba Research institute in collaboration with Merck Research Laboratories, 
Tsukuba r Ibaraki 300-261 1, Japan. 

Obesity is currently considered as an epidemic in the western world, and it represents a 
major risk factor for life-threatening diseases such as heart attack, stroke, diabetes, and 
cancer. Taking advantage of DMA microarray technology, we tried to identify the 
molecules explaining the relationship between obesity and vascular disorders, comparing 
mRNA expression of about 12,000 genes in white adipose tissue between normal, high 
fat diet-induced obesity (DIO) and d-Trp34 neuropeptide Y-induced obesity in mice. 
Expression of monocyte chemoattractant protein-l (MCP-I) mRNA displayed a 7.2-fold 
increase in obese mice as compared with normal mice, leading to substantially elevated 
MCP-1 protein levels in adipocytes. MCP-l levels in plasma were also increased in DIO 
mice, and a strong correlation between plasma MCP-1 levels and body weight was 
identified. We also showed that elevated MCP-1 protein levels in plasma increased the 
CD1 lb-positive monocyte/macrophage population in DIO mice. Furthermore, infusion of 
MCP-1 into lean mice increased the CD 1 lb-positive monocyte population without 
inducing changes in body weight/Given the importance of MCP-1 in activation of 
monocytes and subsequent atherosclerotic development, these results suggest a novel role 
of adiposity in the development of vascular disorders. 
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Augmented expression of ueuronal nitric oxide syntliase in Hie atria 
parasympathetically decreases heart rate during acute myocardial 
infarction in rats. 

Takimoto Y , Aoyatna T , Tanaka K, Kevamura R , Vui_V, Sasayama S . 

Department of Cardiovascular Medicine, Graduate School of Medicine, Kyoto 
University, Kyoto, Japan. 

BACKGROUND: Nitric oxide (NO) synthesized within sinoatrial cells recently has been 
shown to participate in the autonomic control of heart rate. We hypothesized that NO in 
the neuronal cells in the heart was increased and parasympathetically regulated heart rate 
after myocardial infarction (MI). METHODS AND RESULTS: We examined heart rate 
dynamics and neuronal NO synthase (nNOS) expression and activities in the atria of rats 
with MI 1, 3, 7, and 14 days after MI (n-7 to 22 for each group). Both die mRNA levels 
of nNOS in the atria determined by competitive reverse transcriptase-polymerase chain 
reaction and the protein levels determined by Western blotting were significantly 
increased compared with controls 1,3, and 7 days after ML nNOS activity in the atria I 
day after infarction was also increased in MI rats. nNOS immunoreactivity was observed 
in nerve fibers in the atria. After infusion of a specific inhibitor of nNOS and iNOS, l-(2- 
trifluoromethylphenyl) imidazole (TRIM) (50 mg/kg IV), heart rate was significantly 
(P<0.01) increased in MI rats compared with controls l 3 3, and 7 days after ML The 
iNOS-specific inhibitor, 1400W (10 mg/kg SC), did not significantly affect the heart rate 
in rats with ML The effect of TRIM was abolished by pretreatment.with L-arginine (25 
mg/kg IV) or by parasympathetic blockade with atropine but not by propranolol. There 
was a strong correlation (r=0.837 t PO.0001) between the nNOS protein expression and 
heart rate change after TRIM infusion. CONCLUSIONS: These results indicate that . 
increased nNOS parasympathetically decreased heart rate via the production of NO in 
rats with acute ML 



PMID: 1 1815433 [PubMed - indexed for MEDLINE] 



210: J Sur g Res. 20 01 Mar;96(l ): 56-67.- Related Articles, Links 

Differential upregulation of cellular adhesion molecules at the sites of 
oxidative stress in experimental acute pancreatitis, 

Tclek G, Ducroc R , Scoazcc JY. Pasquicr C . Feldmann G . Roze C . 

INSERM U 410, Universite Paris 7 Denis Diderot, 75870 Paris, France. 

BACKGROUND: Severe acute pancreatitis (AP)(2) is associated with exaggerated 
leukocyte adherence and activation. Endothelial cellular adhesion molecules (CAMs) can 
be induced by cytokines, but also directly by oxygen free radicals (OFRs), mediated by 
nuclear factor kappa-B (NF-kappa B). We investigated die behavior of inducible CAMs 
in relation to pancreatic oxidative stress. Our novel modification of cerium capture 
histochemistry (reaction of OFRs witfi cerium produces laser reflective Ce perhydroxide 
precipitates) combined with reflectance confocal laser scanning microscopy (CLSM) 
allo ws the histological codemonstration of in vivo OFR production and imrnunolabeled 
CAMs, or NF-kappa B. METHODS; Taurocholate AP was induced in rats; sham 
operated and normal animals served as controls. To achieve in situ, in vivo reaction of 
cerium with OFRs, animals were perfused with CeCl(3) solution at different time points 
(I, 2, 8, 24 h) and then sacrificed. E-selectin, P-selectin, ICAM-1, VCAM, and NF-kappa 
B p65 were labeled by immunofluorescence (IF) on frozen sections of cerium perfused 
pancreata- IF and Ce perhydroxide reflectance were simultaneously detected by CLSM, 
Pancreatic gene expression of the same CAMs was quantified by competitive RT-PCR 
(MIMIC internal control). RESULTS: Control pancreata showed negligible reflectance 
and minimal CAM expression. Early (1, 2 h) AP samples were characterized by intense, 
heterogeneous acinar OFR production, strong P-selectin, and increasing [CAM 
expression, with nuclear translocation of p65, histologically all colocalizing with the 
areas of acinar oxidative stress. Adherent polymorphonuclear leukocytes (PMNs) 
displayed weak OFR formation. Later (8, 24 h), a slowly declining P-seiectin, but 
persisting ICAM-1 expression, was paralleled by widespread adherence of PMNs 
producing surprisingly large amounts of OFRs. VCAM and E-selectin showed a mild 
increase at 24 h. CAM gene activation was in good correlation with the protein 
expression. CONCLUSIONS: The early acinar oxidative stress is co localized with NF- 
kappa B activation, preferential P-seiectin, and 1CAM upregulation in this AP model. 
Subsequently, adherent, activated PMNs become the major source of OFRs, thereby 
contributing to tissue damage. Copyright 2001 Academic Press. 
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Myotonic dystrophy: an unstable CTG repeal in a protein kinase gene. 
Timchenko L . Moncktqn DG, Caskey CT , 

Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 
Medical Center, Houston 77030, USA. 

Myotonic dystrophy (DM) is caused by the amplification of CTG repeats in the 3 f 
untranslated region of a gene encoding a protein homologous to serine/threonine protein 
kinases. In DM patients the CTG repeats are extremely unstable, varying in length from 
patient to patient and generally increasing in length in successive generations. There is a 
strong correlation between the size of the repeats and the age of onset and severity of the 
disease. The molecular basis of the effect of the CTG expansion on the development of 
the DM phenotype continues to be investigated. The first working hypothesis of the 
molecular mechanism of DM was a reduction in steady-state myotonin-protein kinase 
(Mt-PtC) mRNA and protein levels. However, although the consensus finding is diat the 
Mt PK mRNA and protein levels are decreased in DM patients, it is still not clear if this 
reduction leads directly to the DM phenotype. In this short review we discuss the 
molecular aspects of CTG instability and the expression of the myotonin-protein kinase 
gene in normal and DM populations. 
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Induction of class 3 aldehyde dehydrogenase in the mouse hepatoma cell 
line Hepa-l by various chemicals. 

Torroncn R , Kot kalaineii M . ECarcnlampi SO . 

Department of Physiology, University of Kuopio, Finland; 

The mouse hepatoma cell line Hepa-l was shown to express an aldehyde dehydrogenase 
(ALDH) isozyme which was inducible by TCDD and carcinogenic polycyclic aromatic 
hydrocarbons. The induced activity could be detected with benzaldehyde as substrate and 
NADP as cofactor (B/NADP ALDH). As compared with rat liver and hepatoma cell 
lines, the response was moderate (maximally 5 -fold). There was an apparent correlation 
between this specific form of ALDH and aryl hydrocarbon hydroxylase (AHH) in the 
Hepa-l wild-type cell line-in terms of inducibility by several chemicals. However, the 
magnitude of the response was clearly smaller for ALDH than for AHH. Southern blot 
analysis showed that a homologous gene (class 3 ALDH) was present in the rat and 
mouse genome. The gene was also expressed in Hepa-l and there was a good correlation 
between the increase of class 3 ALDH-specific mRNA and B/NADP ALDH enzyme 
activity after exposure of the Hepa-l cells to TCDD. It is concluded that class 3 ALDH is 
inducible by certain chemicals in the mouse hepatoma cell line, although the respective 
enzyme is not inducible in mouse liver in vivo. 
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Relationship between cyciin Dl and p2l(Wafl/Cipl) during differentiation 
of Iiutnaii myeloid leukemia cell lines, 

Ullmannova V , Stockbaucr P , Hradcova M Soucek J , Haskovec C . 

Department of Molecular Genetics, Institute of Hematology and Blood Transfusion, U 
Nemocnice 1, 128 20 Prague 2, Czech Republic, ullmjan@uhkt.cz 

Expression of cell cycle-regulating genes was studied in human myeloid leukemia cell 
lines ML-1 , ML-2 and ML-3 during induction of differentiation in vitro. 
Myelomonocytic differentiation was induced by phorbol ester (12-o-Tetradecanoyl- 
phorbol~13-acetate,TPA), tumor necrosis factor alpha (TNFalpha) or interferon gamma 
(INFgarnma), or their combination. Differentiation (with the exception of TNFalpha 
alone) was accompanied by inhibition of DN A synthesis and cell cycle arrest. Inhibition 
of proliferation was associated with a decrease in the expression of cdc25A and cdc25B, 
cdk6 and Ki-67 genes, and with increased p21(Wafl/Cipi) gene expression, as measured 
by comparative RT-PCR. Expression of the following genes was not changed after 
induction of differentiation: cyciin Al, cyciin D3, cyciin El and p27(Kipl). Surprisingly, 
cyciin D 1 expression was upreguiated after induction by TP A, TNFalpha with 
.IFNgamma or BA. Cyciin D2 was upreguiated only after induction by BA. The results of 
the expression of the tested genes obtained by comparative RT-PCR were confirmed by 
quantitative real-time (RQ) RT-PCR and Western blotting. Quantitative RT-PCR showed 
as much as a 288-fold increase of cyciin Dl specific mRNA after a 24h induction by 
TP A. The upregulation of cyciin Dl in differentiating cells seems to be compensated by 
the upregulation of p2 l(Wafl/Cip I ).These results, besides others, point to a strong 
correlation between the expression of cyciin Dl and p21(Wafl/Cipl) on the one hand 
and differentiation on the other hand in human myeloid leukemic cells and reflect a rather 
complicated network regulating proliferation and differentiation of leukemic cells. 
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Intestinal carbamoyl phosphate synthase I in human and rat Expression 
during development shows species differences and mosaic expression in 
duodenum of both species. 

Van Beers EH; -Rin gs EH , Posthuma G . Dingemanse MA . Taminiau JA , Heymans 
HS, Etnerhand AW , Bullcr HA , Dckker J . 

Pediatric Gastroenterology and Nutrition, Department Pediatrics, Emma Children's 
Hospital, Academic Medical Center, Amsterdam, The Netherlands. 

The clinical importance of carbamoyl phosphate synthase I (CPS!) relates to its capacity 
to metabolize ammonia, because CPSI deficiencies cause lethal serum ammonia levels; 
Although some metabolic parameters concerning liver and intestinal CPSI have been 
reported, the extent to which enterocytes contribute to ammonia conversion remains 
unclear without a detailed description of its developmental and spatial expression 
patterns. Therefore, we determined the patterns of enterocytic CPSI mRNA and protein 
expression in human and rat intestine during embryonic and postnatal development, using 
in situ hybridization and irnmunohistochemistry. CPSI protein appeared during human 
embryogenesis in liver at 31-35 e. d. (embryonic days) before intestine (59 e.d.), whereas 
m rat CPSI detection in intestine (at 16 e.d,) preceded liver (20 e.d.). During all stages of 
development mere was a good correlation between the expression of CPSI protein and 
mRNA in the intestinal epithelium. Strikingly, duodenal enterocytes in both species 
exhibited mosaic CPSI protein expression despite uniform CPSI mRNA expression in the 
epithelium and the presence of functional mitochondria in all epithelial cells. Unlike rat, 
CPSI in human embryos was expressed in liver before intestine. Although CPSI was 
primarily regulated at the transcriptional level, CPSI protein appeared mosaic in the 
duodenum of both species, possibly due to post-transcriptional regulation. 
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Expression of deoxycytidine kinase in leukaemic cells compared with solid 
tumour cell lines, liver metastases and normal liver. 

vander Wilt CL, Kroep jR, Loves WJ , RotsMG T Van Groeniagen CL Kaspers GJ, 
Peters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK) is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) and 2-chlorodeoxyadenosine (CdA) in 
the treatment of acute myeloid leukaemia (AML) and gemcttabine to treat solid tumours. 
In this study, expression of dCK mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 1 6 childhood and adult AML samples, 1 0 human liver samples and 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK in the cell lines were closely related to the mRNA expression 
levels (r=0.75, P=0.026 and r=0.86, P=O.007): In AML samples, dCK mRNA expression 
ranged from L 1 6 to 35.25 (x 1 0(-3)xdCK/beta-actin). In the cell line panel, the range was 
2.97-56.9 (xlO(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCK mRNA expression (r=0.497, P=0.05). In the liver 
samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver while a 150-fold range was observed in the liver metastases. In addition, dCK 
activity and mean mRNA levels were 2.5-fold higher in the metastases than in the liver 
samples. Since dCK is associated, with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in malignant 
cells and tumours, the CT-RT PCR assay will be useful in the selection of patients that 
can be treated with deoxycytidine analogues. 
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Expression of cytokines and growth factors in human 
glomeruloucphritides. 

Waldherr R , Noronha IL , Ntemir Z , Kruger C Stein H , Stunim G 
Department of Pathology, University of Heidelberg, Germany. 

Numerous experimental studies point to the potential role of cytokines and growth factors 
in the pathogenesis of renal disease. However, from the various autocrine and paracrine 
mediators identified in vitro and in animal models, so far only a few have been 
demonstrated in selected human glomerulopathies. We examined two types of 
glomerulonephritis (GN): extracapillary GN with anti-neutrophil cytoplasmic 
autoantibodies (ANCA), an example of an acute form of GN, and mesangial IgA GN, 
usually a chronic form of GN, with immunocytochemistiy, in situ hybridization and the 
polymerase chain reaction. Normal renal tissue from tumour nephrectomies served as a 
control. In ANCA-positive GN with active renal lesions (crescents, glomerular and 
vascular necrosis), infiltrating mononuclear cells in glomeruli and in the interstitium 
expressed interleukin (IL)-1 beta, tumour necrosis factor (TNF)-alpha, IL-2, interferon 
(IFN)-gamma, platelet-derived growth factor (PDGF) and transforming growth factor 
(TGF)-beta. Cytokine expression was also observed in activated resident cells, including 
endothelial cells, capsular epithelial cells, smootfi muscle cells of vessel walls, fibroblasts 
and some tubular epithelial cells. In addition, we noted an increase in the cytokine and 
growth factor receptors TNF-R, IL-1R type II, IL-2R, IFN-gamma R and PDGF beta-R^ 
In contrast, in mesangial IgA-GN, IL-l beta, TNF-alpha, IFN-gamma and IL-2 were 
usually absent in glomeruli. Mesangial expansion in this disorder was accompanied by an 
increased expression of PDGF, PDGF beta-R, TGF-beta and IL-6 in mesangial areas. In 
both conditions a good correlation was observed between cytokine expression at the 
mRNA (in situ hybridization) and protein level (immunocytochemistry).( ABSTRACT 
TRUNCATED AT 250 WORDS) 
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Malignant transformation of the human endometrium is associated with 
overexpressiou of lactoferrin messenger RNA and protein. 

Walmer QIC Padin CJ , Wrona MA , If eat y BE , Bentley RC Tsao MS , Koliler MF . 
McLachlan J A . Gray KD. 

Department of Obstetrics and Gynecology, Duke University Medical Center, Durham, 
North Carolina 277 10. ' 

In the mouse uterus, lactoferrin is a major estrogen-inducible uterine secretory protein, 
and its expression correlates directly with the period of peak epithelial cell proliferation. 
In this study, we examine the expression of lactoferrin mRNA and protein in human 
endometrium, endometrial hyperplasias, and adenocarcinomas using 
irruriimohistochemistry, Western immunoblotting, and Northern and in situ RNA 
hybridization techniques. Our results reveal that lactoferrin is expressed in normal 
cycling endometrium by a restricted number of glandular epithelial cells located deep in 
the zona basalis. Two thirds (8 of 12) of the endometrial adenocarcinomas examined 
overexpress lactoferrin. This tumor-associated increase in lactoferrin expression includes 
an elevation in the mRNA and protein of individual cells and an increase in the number 
of cells expressing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also observe distinct 
cytoplasmic and nuclear immunostaining patterns under different fixation conditions in 
both normal and malignant epidielial cells, similar to those previously reported in the 
mouse reproductive tract. Serial sections of malignant specimens show a good correlation 
between die localization of lactoferrin mRNA and protein in individual epithelial cells by 
in situ RNA hybridization and immunohistochemistry. Although the degree of lactoferrin 
expression in the adenocarcinomas did not correlate with the tumor stage, grade, or depth 
of invasion in these 12 patients, there was a striking inverse correlation between the 
presence of progesterone receptors and lactoferrin in all 8 lactoferrin-positi ve 
adenocarcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which is not shed with menstruation and is 
frequently overexpressed by progesterone receptor-negative cells in endometrial 
adenocarcinomas. 
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ABSTRACT 

* Id the mouse uterus, lactofemn is a major eslrngen-inducible uterine 
secTetorr protein, and its expression cor relates direelly with ibe period of 
peak epithelial cell proliferation. In this study, we examine the expression 
' of lactoferrin mRNA »nd protein In human endometrium, endometrial 
hyperplasias* and adenocarcinomas using immunohfetochtrnistry, West- 
ern immunoblolting, and Northern and in sitv RNA hybridisation tech- 
niqnes. Our results reveal that lactoferrin is expressed in normal cycling 
endometrium by a restricted number of glandular epithelial cells located 
deep in the zona basalis. Two thirds (8 of 12) of the endometrial adeno- 
carcinomas examined over express lactoferrin. This tumor-associated in- 
crease in lactoferrin expression includes an elevation m the mRNA and 
pTOtein of individual cells and an increase In the number of cells express- 
ing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in factofenm. Wt also 
observe distinct cytoplasmic and nuclear immunostalning patterns under 
different fixation conditions in both normal and malignant epithelial cells, 
similar to those previously reported in the mouse reproductive trsct- 
Serial sections of matignant specimens show a good correlation between 
the localization of lactoferrin mRNA and protein in individual epithelial 
ceils by in situ RNA hybridization and immunohisiocbunistry. Although 
the degre* of lactoferrin expi ession in ih* adenocarcinomas . did not 
correlate with ibe tumor stage, grade, or depth or invasion in these 12 
patients, there was a striking inverse corr elation between the presence of 
progesterone receptors and lactoJerrin in all 8 laciofrrrin-positive adeno- 
carcinomas. In summary, larioferrin is expressed in a region of normal 
endometrium known as the zona basalis which 1$ not shed with menstru- 
ation and is frequently overexpressed by progesterone receptor-negative 
cells in endometrial adenocarcinomas. 



INTRODUCTION 

"The uterus is a sex stcioid- responsive organ that plays a major ioJc 
in women's health. Hysterectomies wcic the most frequently per- 
formed roajoi surgical procedures in a 20-year study interval 
(1965-1984; Kef. 1). Fifty-eight to 80% of ihesc 12.5 million 
procedures were perfoimed foi esiro°en-r elated disorders of prolifer- 
ation. Chronic unopposed estrogen ejrposuje, roost commonly asso- 
ciated with type II ovulatory disordejs, eventually leads to the devel- 
opment of complex endometrial hyperplasia and adenocarcinoma. 
Since the sex steroids, estrogcD and progesterone, act on their target 
tissues by regulating the expression of a wide variety of signaling 
molecules, identifying these regulatory fsctois will provide critical 
information towards under standing normal jepjoducuoD and repro- 
ductive tract pathology. Our cunent knowledge of esnogen and pro- 
gesterone action on the reproductive tract is based to a gjeat extent on 
information collected from todents (2). Although ditfetences exist 
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between the. reproductive physiology of rodents and humans, the 
mouse bas dccd a useful model for studying steroid hormone action to 
the human female repioduclive tract (3, 4). One potential regulatory 
molecule shown to be regulated by estrogen in the mouse reproductive 
tract is lacto/errin. Ladofenin is a basic glycoprotein with an extraor- 
dinarily high affinity for iron that was originaUy discovered in mUk. 
This protein is expressed in a wide variety of tissues, roost notably to 
polymorphonuclear leukocytes and. roost mammalian exocrine glan- 
dular secretions. In the mammary gland (5) and ibe female reproduc- 
tive tract of the mouse (6-8), lado/enin is regulated by endocrine 
hormones. Prolactin stimulates lactoferrin synthesis in the breast; 
whereas in the uterus and vagina, the ovarian seat steroid, 170- 
estradiol, is the inducer. (6, X 9). To date, lactoferrin is one of the few 
genes that have bcefi identified in the rodent that are directly regulated 
by estradiol. The Jactofenin gene contains an ERE 2 that is important 
for. jegulating its expression in vivo in the mouse reproductive tract 
Being linked io esnadiol, the expression of lactoferrin by the uterine 
epithelium parallels the onset of DRA synthesis. Although sequencing 
information suggests that Ibe human lactoferrin gene also contains a 
functional imperfect ERE in the 5'-flaraHrjg promoter region (10, 11^ 
there is very little data regarding lactoferrin expression in the human 
female reproductive tract; 

The purpose of our study was to examine the expression of lacto- 
ferrin in the human endometrium under normal and pathological 
conditions by immuDohistocheroistry, immunoblotling, and Northern 
and in situ RNA hybridization techniques. In addition, we looked fox 
con clarions between lactoferrin expression and several parameters, 
such as the stage of the menstrual cycle, the distribution of estrogen 
and progesterone receptors, HER-2/new expression, markers of cell 
proliferation, and the histopathologicaj grade and extent of myome- 
irial invasion in ibe adenocarcinomas- Our data demonstrates that 
lactoferrin is expressed in a very restricted Dumber of glands in the 
basal region of normal human endometrium and is markedly over ex- 
pressed in a significant number of the uleiirre adenocarcinomas by 
PR-negative celfs. 



MATERIALS AND METHODS 

Tissue Preparation ond Histological Equation. Surgical pathology 
specimens were obrameri bom Duke Univerriiy Medical Cemei (Durham, NQ 
and the Depajtmenr of PaiboJogy at Mod Deal GeDeial Hospital (Quebec, 
Mostjtal, C^r.sda). Cycling eodorocliiuin was obtained j»oro 22 women (age* 
3t-49) r and auopbic endometrium was obtained bom 7 postmenopausal 
women (age* 64-77). HysttittiOTme;- lioni cycling women wtte performed 
ioj substjosal leiomyomas (n = 6). pctvjc itJajxaiioD f> - 8), pelvic paw» 
(n = 4), pciHoneal endometriosis (n - 2), and cancer of either the exocervw 
In = 1) ot the ovary (n = J). In addition, 12 adenocarcinomas, 3 * typical 
complex hyperplasias, 5 compkJ hyperplasias without atypia, and .4 simple 
bvpejplasits were analyzed. Each human uterus wa* bivalvcd sbordy aitei 
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hysterectomy, and endometrium was removed from tbc fundal region. A full 
tbic)c»ess biopsy was placed into cither 10% neutral-buffered formalin or 
£ouin s solution overnight at room temperature before dehydration, paraffin 
embedding, and sectioning at 4 /un on silarrtted slides. Histological cvalua- 
do»s of hematoxylin and eosin-staincd slides were performed blindly by one 
board-certified pathologist. Norma! endometrial samples were dated by die 
criteria of Noyes ci oL (12). Endometrial hyperplasias and carcinomas were 
classified according to the current recommendations, of the International 
Society of Gynecological Pathologists under the auspices of WHO (13). 
Histological grading of tumors was performed according to Federation Inter* 
pationale des Gynecologist ts et Obstetristes criteria (]4)l Each specimen was 
ttzd a minimnro of three times, and only specimens that were read consistently 
the same way were included in the study. Unstained sections of the same 
tissues were used foi the cylochemical analysis of protein and roRNA expres- 
stOD using specific reagents. A few endometrial samples were frozen for 
subsequent protein and RNA extraction, which were evaluated by Western and 
>lojlbern blotting, respectively. Al) human tissues were handled with the 
precautions and the guidelines requijed by Duke University and National 
Institute of Environmental Health Sciences. 

ImmunolocaKiation. Slides chosen for srudy woe depa/affimzed and 
rinsed in 20% glacial acetic add at 4"Cfor 15 s to inhibit endogenous alkaline 
phosphatase. All subsequent incubations and washes were at room tempera- 
ture. Sections were next equilibrated in PBS for 20 roin and incubated for 20 . 
roin with 1-5% normal goal serum dfluted in PBS to block nonspecific binding. 
Detection of lactoferrin was performed primarily with a rabbit anti- human 
lactoferrin polyclonal antisera generated in our laboiatory and affinity purified. 
Similar results were also seen with a d on affinity-purified commercial antisera 
(Brogcntx, San Ramon, Ca). Following incubation at room temperature with 
primary antisera for 60 roin, the seeiions weie washed in PBS twice for 10 min 
each, and lactoferrin was located using an alkaline phosphatasc-biotiw- 
sueptavidin detection system (Vectastain ABC-AP kit: Vector Laboratories, 
BurJingame, CA; or the Super Sensitive Detection System: Biogenex, San 
Ramon, CA). To identify nonspecific staining, pre immune rabbit serum was 
used in place of the primary antibody. The immunoteaclion was quantitated by 
determining the per cent age of glands and I be peiceniage of cells staining foi 
teciofenin in the 2ona basalts and the aorta funciionalis, with b minimum of 
300 cells counted in each region. PR antibody was provided by Geoffrey 
Greene (KD66), and a commercial soujcc was also used (Biogenex, San 
Ramon, CA). Identical staining pa herns were confirmed with both piepara- 
tions. Other commercially obtained anijscra include PCNA {Biogenex, San 
Ramon, CA), ER (EJUD5; AMAC, Westbrook, Ma), M1B-1 (AMAQ, and 
HEH-2/«ew (Biogenex, SaD Ramon, CA). The prim an- antisera incubations 
were 2 b for the PR> 1 b for PCNA, M1B-1. and ER, end 30 min for 
• HER -Uneu. Antisera dDutions were 1:100 Jor M1BO and 1:20 for HER-2/nen. 
Antigen reuievaJ (Biogenex, San Ramon, CA) was performed before adding 
the progesterone primary antisera. 

Western Blot Analysis. Pjoteins were extracted bom endometrial biopsies 
by homogenization on ice in J% TriibnOf and 20 mw Tris-HCJ (pH 7.4) with 
protease inhibitors (10 ng/ml Icupcpiin. 200 KUVml apfOtinin, and 20 pg/ml 
phenylmetbylsulfonyl fluoride) and clarified by cenmfugadon al 45,000 rpm 
for 30 min in a Beckman 70.1 Ti ioioi: then the supernal an) was analyzed for 
protein concentration by ibe BCA pioitin assay (Pierce, Rotkford, IL). Al> 
quois of 200 fig were separated by electrophoresis on an 83% SDS polyac- 
lylamide gel, blotted onto nitroceDuIose ntembianes, incubated with poly- 
clonal rabbit antihujnan lacioferrin Bnbseia, and localized with an >75 1- labeled 
donkey anti-isbbit immunoglobulin, bi described previously (7). 

In Siru Hybridization. All slides werr pre Healed with 0.2 N HO for 50 
nun at room lemperaruTe, digesitd with 1 ^tg/ml proteinase -K (Sigma Chein- 
icai Co., St. Louis. MO) b O.r m Tiis-HCl (pH 7.4VG.Q5 u £DTA /or J 5 min 
21 37"C, and then treated wrlb 0.1 M nit ibanolanuue- 0.25% acetic anhydride 
lor 5 min at. room temperature and 0.1 m Tris-gJvcine (pH 7.4) Jot 30 min at 
tootn temperature. Tbt sections were lubiequr ntly dehydrated with gjaded 
fihauol, aii dried, and pre hybridized ai iO'C for J b in 7 > SSC 10 dim D1T, 
- * Dcnhajdt'* solution, 1 DO ptffn) of both salmon ipcrm DHA and ?eaM 
'^NA and 50% formamide (15). The slides were iben hvbrioized overnight at 
^0*C in the ianjc medium with 10% riexiian sulfate and 2*10* cprnZ/jJ of ibe 
J prcific RNA probe. The Isctofcrrin oligonucleotide pjobe wa> amplified by 
PCR using primers that spanned nucleotides 718-3654 (70) and cloned into 
pGEM-42. "S- labeled iense and antisensr RNA piobe? weie in»de wiib ibe 



Promega Riboprobe kit (Promcea, Madison, Wi*X washed twice in 1 X SSC foi 
JO mb at room temperature, digested with RNase [2J >g/ml RNase-A, 03 
^g/nJ KNase-TJ, 10 mM Tris-HO (pH 7.4), and 15 mM Nad}, and washed 
again with 1 X SSC twice for 20 min each time at 50°C, twice for 20 m 
0J X SSC al 55*C a»d once foi 20 vrio at 6tTC The sections were then 
dehydrated and dipped- m Kodak autoadSographrc emulsion (NTB-2) for 
dejection of specific mRNA tipression. The slides were allowed to develop fat 
2 weeks. After this period, the slides were developed using Kodak PJ9 
developer and Kodak Rapid Fixer. 

Northern RNA Aoarysis. Total cellular RNA was purified feom fioien 
tissue by the gua^dinc isotbiocyaDate-cesium chloride method, and polyCA*> 
RNA was isolated by oligo(dT><ellolose chromatography using methods 
described previously (15). For Northern blot analysis, po!y(A*>RNA*was 
resolved by electrophoresis on 13% formaldehyde agarose gels, stained with 
ethidium bromide, and transferred to a nylon membrane. The membrane was 
probed with a ^P-labeled bctoferrin cDNA derived from human trlerus (»»- 
deotides 718-1654; accession do. S52659) using PCR techniques, followed 
by inning into pGEM-42 (Promega, MadUon, WQ. In order to insme that the 
quality and quantity of RNA analyzed by Northern blottmg was equivalent 
between control and treated groups, the Hot was probed simultaneously for 
glyceraldehyde*3-phosph»le dehydrogenase. 

. Statistical Analysis. Values are presented as means ± SD. Differences 
between the zona basalis and funclionalis we tested by the two-uDed 
Student's r tesL • - 

RESULTS 

liirmunohistochemical Analysis of Lactoftrrin Protein 

Norma) Cycling Endometrium. Inunuw>hislocherjDical studies of 
normal cycling human endometrium localize lactofemn protein pre- 
dominantly to the glandular epithelium deep in the zona basalis and 
not to the functionalis (Fig. Tbc association of lactoferrin pTOtein 
expression with the zona basalis is statistically significant. (P < 0-001; 
Fig. 2). Two to 56% of the glands express lactofenin. al any given time 
during the menstrual cycle. Within positive glands, lactoferrin protein 
irnmunolocaJizatioij is heterogeneous in thai positively staining epi- 
thelial cells are interspersed with cells negative for lactoferrin expres- 
sion. No apparent differences in morphology, PCNA, ER, oj PR 
expression are seen to account for the heterogeneous pattern of intrar 
and intetgland lactoferrin expression in normal endometrium. Similar 
to our previous findings in mouse uterine epithelial cells, the positive- 
staining glandular cells of the human endometrium demonstrate two 
distinct iromunos taming patterns foi lactoferrin, cytoplasmic and nu- 
clear {Fig. W\ seen v*ith both formalin and Bourn's fixation. In 
evaluating the temporal expression of lactoferrin, tbeje is a trend 
towards more glands expressing lactoferrin (luring the secretory 
phase. Because oi the large variance, tbc trend is not statistically 
significant. As expected, the polymorphonuclear leultocylcs in the 
endometrium also stain intensely for lactofemn protein, which is 
stored in their secondary granules (Fig. 3> These results demonstrate, 
for the first time, ibal lactoferrin protein is expressed in tbe human 
endomeuiuro predominately by polymorphonuclear leucocytes and 
epithelial cells of glands located deep in tbe zona basalis. 

Proliferative Endometrial Disorders: Jlyperplasios and Adeno- 
carcinomas, kriniunohisiochemisiry reveals thai tbe expression of 
iaaoUsns prrJrin i? incrr^vrd in 66.6% (8 of 15) of ihr. mdornelrial 
adenocaicinomas examined. In one-half of these cases (4 of 12), 
laciofenin is intensely expressed by malignant epithelial cells 
throughout the entire tumois (Tig. 4). The othct four adenocarcinomas 
demonsrraie increased Mainins for laciofenin in concentrated regions 
of die rutnois. In all eight cases where lactolenin expression is 
elevated, the cells expressing lactofemn have one similarity ^tb 
normal positive glands in thai they demonstrate heterogeneous slav- 
ing of inlets pet sed positive and negative cells. However, the expres- 
sion ol lactoferrin by the malignant cells clearly differs fiom normal 
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Fig. 1. LocaHisbon of UooJenin protein in oorxoal cycling endorottnuro by imroo- 
' nohisiocbtmisDy niing a specific polyclonal antibody. Ottr u»fysi$ rtvepk U»l !acioi- 
tmn protein is prucntin » limited number of gl»nd5 Jocaicd in the zona b»5*lii of (be 
ttidorotoiuro MJt X )a Also, note tb*t lacioftniD is Uttrojrerxous wiifcin positive 
glands, aJls staining for lactofcnin »t inlcjxpeiicd wirb ncg»im-si;»mmg cp>lbcti») 
c*Uj throughout the gi»nd. Two immonohtjtocbcmicsd staining patterns arc noted lot 
Jacioftrrin in norma) uterine epirheiiura. In ont pattern, l»ctoferrin protein is immunolo- 
caliztd primarily over the cytoplasm (£ J), and in the other, the sttining ia wen over the 
nucleus (57); X 40. 



positive gjands in that the lactoferrin is n6t limited lo the basal regions 
of the tumors, many more ceDs are positive, and the relative intensity 
of the staining ovei individuals ceUs is inaeased. Although increased 
lactoferrin expression is associated with malignant transformation, we 
do not find a con elation between lactoferrin protein presence and the 
stage, nuclear grade, Federation Internationale des Gynecologistes et 
Obstetnstts grade, or the depth of myometrial invasion in the 32 
rumors studied (Table 1). In sharp contrast to the common dysregu- 
) a lion of lactoferrin expression found in the malign an t endometrium, 
only 1 of 10 endometrial hyperplasia specimens evaluated contained 
an ina eased number ol cells staining foi lactoferrin. The hyperplastic 
specimen overexpressing lacloferrin was read as complex without 
atypia. 

in Situ and Northt-rn Analyses of Lactoferrin mRNA 
Expression in Normal and Malignant Endometrium 

To further our ur>de; standing of the location cf lactoferrin protein 
synthesis in the human endometrium, we examined lactoferrin mRNA 
expression by in shv and Northern hybridization m'mg specific 33 S- 
Jabeled probes for human lactoferrin. No detectable RNA hybridiza- 
tion is observed in the normal endometrium by in jmu bybridi2aiion f 
even in the presence of iromunodelectable protein, usin» equivalent 
hybrid »2 itt ion conditions and development times a5 used for the ade- 
nocarcinomas. Continent with the lesurts obtained by in siiu hybrid- 
ization, long exposuit timei- weie. lequired to demonstrate lactoferrin 
mRNA in normal endometrium by Noilhern blot using poly(A"*)- 
mRNA. This indicates ibai laaoftnin rnRNA is present in normal 

i no 
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tissue but in veiy low levels, consistent with the very limited pattern 
of protein expression in normal endometrium- Equivalent RNA load- 
ing and quality for each specimen was demonstrated by elhSdimrj 
bromide staining of the RNA gels (data not shown) and by probing for 
the housekeeping gene g|yceraldebyde-3-phosph3te dehydrogenase 
that does not fluctuate significantly with the metabolic state of the 
(issue. A representative Northern blot is shown in fig. 5. In situ 
hybridization with several ade^oc^cinomas reveals that there is a 
direct correlation between the localization of lactoferrin mRNA and 
the immunos taming of expressed lactoferrin protein (Fig. 4> Lactof- 
errin mRNA is not associated with polyroorphonuclcai leukocytes by 
in situ hybridization in either normal or malignant tissue. - 

Western Blot Analysis 

To confirm the specificity of the antisera that we used foT unrou- 
howstocneroistry, we performed Weston blot aoa/ysrs on proteins 
extracted from both normal and malignant endometrium which were 
separated by 85% SDS-polyacrylaroide gel electrophoresis (Fig. 6>. 
Irorounoblotting identified a single broad protein band with a molec- 
ular weight between 70,000-80,000 in both normal and neoplastic 
endometrial tissue homogenates, consistent with the reported molec- 
ular weight of human lactoferrin. Supporting the irim3unocytod)emi- 
cal analysis, a representative irarounoblo! clearly demonstrates; that 
tbe proportion of protein that is lactoferrin is markedly increased in 
ibe adenocarcinomas* in comparison to the normal endometrium. The 
molecular weight of Jactoferrio in the adenocarcinomas appears to 
have a slightly higher molecular weight thai the predominant form in 
normal tissue. 
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Fig. 7. Pciccnuse of rndorotjnal rtamh upr cuing UaoJtrria pi ot tin by imDiuno- 
hurochemialr*. Sifniftondy nooie flindl sic positive in iht rrgion of tbe 2002 bawt» 
itian in ine ion* rurtrtion»lis of tbt e ndomciinmi {P * 0.001). Zvm bssalii, B; 
Junciiomli*. R Ahhoegh tnrtr is a 11 cod toward* Diwt of iht baislis plandi ^rxpitV-WS 
(^.trfenin in in* sf at lory ph»« (rifhi 9 t omp?md viih th* Uh P), ihu vj! o& 
«,i>usiicyliv iienilitant. 
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Fig. 3. PoIyroorphomjcJct; Jcufcocytcs (orrtn»-j) art scattered thiougbftui iht tndorot- 
niuni and mud intensely lot bctofcrTin. L»aofe/rin it * known component of ibe 
irrondiry £j»nuks in po)yrooFpbonuclt»i leukocyte*. The preienct af a segmented 
niirlcus «r»d Ucioftma piotcin » »n txcrllejji method for identifying ibis group of 
bflsmmstoij cells. A, x 20. X BO. 



Correlation of Lactoferrin Expression with the Expression of 
PCNA, Ki-67, HER-2/»ru, ER, and PR 

In an attempt to characterize the phenotypc of endometrial cells 
which express lactoferrin, we performed imrounohisiocberorsbry on 
stria! sections for ihc Ki-67 antigen, PCNA, HER-2/nrw, Jactofenin, 
ER, and PR- In normal tissue, Ki-67 and PCNA expression are cell 
cycle-specific maj leers or* eel) pi olifc ration (16 r 17). Upon analysis of 
normal cycling endometrium, do relationship between la a of en b pro- 
tein expression and ER, PR, or Ki-67 .expression was observed. 
Similarly, b roosl of the adenocarcinomas evaluated, no relationship 
was noted between lactoferrin and PCNA pjotein expression. How- 
ever, in one adenocarcinoma (Fig. 7), there was a clear inverse 
i elation* hip .seen between jaciofenb and PCNA localization, which 
was present throughout the entire tumor. Most dramatic, however, was 
* suiting inverse con elation seen between lactoferrin and PR expres- 
sion in 8 of 8 PR- positive uterine adenocarcinomas (Erg. 7). Two 
tumors negative for PR also did not express Jact of errb Although an 
inverse correlation was also suggested with HER-Z'ne» and PR, (he 
inverse collection was moic precise with Jacioferrin in these tumors. 

DISCISSION 

In (he mouse uterus, lacrofcirin is an cstiopen-induced ulerine 
'■tcietory protein (hat is present throughout the rpithelium (7), and it 



is expressed concomitantly with epithelial eel) proliferation. In con- 
trast to Iactofejib's 'ubiquitous expression in the estrogenized mouse 
uterine epimelium, laclofenin protein is limited to glandular epithelial 
cells in the basal regions of normal human endometrium and usually 
to glands that were directly adjacent to the myometrium (t^ the 
deepest glands of the . zona basalis). This regional localization of 
lactoferrin expression is not surprising in that other bjocbejnical 
parameters have been reported to show site specificity b primate 
endometrium. These parameters include the proliferative index and 
the expression of the secretory component of IgA {18). Similar lo our 
observations in the endometrium, lactoferrin is also expressed region- 
ally in the mammary gland. In bovine breast tissue, lactoferrin is 
localized primarily to the basal alveolar cells (39); whereas in human 
breast tissue, the ductal epithelium appears to be the primary source of 
secreted laclofenin during lactation (20). 

Examining the endometrium on different days of the menstrua] 
cycle demonstrates a trend towards increased lactoferrin expression 
during the luteal phase. Although this data is not statistically signif- 
icant the cyclic variation may be biologically relevant Kim et at (23) 
recently repoited thai the basal endometrial epithelial cells are unique 
because they proliferate during the postovulatory luteal phase. Inter- 
estingly, in the mouse uterus, there is a direct correlation between 
lactoferrin expression and epithelial cell proliferation (6). Therefore, 
lactoferrin may have a similar role in the human and mouse endome- 
trium. If laclofenin expression is- cyclic, the ERE in the 5'-flankbg 
promotei region of the human lactoferrin gene may be activated 
during the luteal phase (30, 13). 

Anothci similarity between human and mouse uterine lactoferrin 
expression is the observation of two iminunobisiochernical stain bg 
■ patterns. In one pattern, the antisera binds primarily over the cyto- 
plasm, and b the other, the nucleus is the primary site of localization. 
Although this could represent a fixation artifact, we have now ob- 
served Oris pattern in two species and undeT different fixation condi- 
tions. It has been demonstrated that signaling peptides, i.e.. platelet- 
derived growth factor (22), lnl-2 (23), and probasin (24), can be 
selectively direcied bto the nucleus, cytoplasm, or secreted, ft is 
believed that this differentia) processing may allow pioicjns to have 
btracrbe, autocrine, and paracrine roles, dependbg on the physio- 
logical stale of the cell. The two localization patterns observed suggest 
thai laciofcrrio might have signalbg sequences that direct the final 
destination of the mature peptide. 

In normal endometrium, laclofenin mRNA is present but b very 
low levels. Prolonged exposure times are needed to visualize the 
mRNA band with Northetn analysis. Although with in siw hybrid-, 
ization the lactoferrin mRNA signal is easily seen in adenocarcino- 
mas, we failed to localize the laaofeub mRNA in normal endome- 
trium usmg an equivalent exposure time. These low levels of mRNA 
b normal endometrium suggest that the synthesis and degradation of 
lactoferrin mRNA is more tightly regulated in normal tissue than in 
tire adenocarcinomas. Because of the low levels of message in normal 
tissue, wc are unable at this time lo definitively conclude that laclo- 
fenin mRNA is synthesized by tire same epithelial cells which express 
the protein. 

In eodomtUial adenocarcinomas, malignant transformation of the 
endometrium is associated with the up-regulation of lactolenin 
mRNA and protein biosynthesis. The up- regulation at the RNA level 
is demonstrated by an increase in sleady-srate RNA levels using both 
in siw hybridization and Northern analysis techniques. In these can- 
cers, wc also observe an increase in ibe number of lacioferrin-positivc 
cells, which expjess both the protein and mRNA. In this study, 8 of 
the 12 adenocarcinomas evaluated ovcrcxpress lactoferrin, compared 
with only 1 of 10 hyperplastic specimens. The form of lactoferrin 
•protein extracted horn endometrial adenocarcinomas appears to have 
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Fig. 4. Colocaliurion of lactofenin protein panels) and 
mRNA (right poneb) in a normal pioliferative endometrium wmJ 
endometrial »dtDOCardwm»i by performing immuDohjsttxhcjBt*- 
try and m jiiu RNA hybridization on serial sections. Dm) analysis 
of protein and mRNA expression itvcaU that glands in nonnaJ 
endometrium do dot have detectable mRNA, as measured by m sint 
hybridization (tty pontic X 40), whereas analysis of the adtoo- 
carcinomas ricsdy demonstrates a d»ect correlation bcrwtcD pro- 
uin bnd UNA expression loi Jacxoftrrin (*im2c£»* and Jxxkw potuls^ 
X )0t Kote tfial bctofenin protein and mRNA is distributed, ia t 
hcltrogcBtoos pattern in the epithelial cells of (be adcoocajcinomas. 
As is shown in Fig. 1, > heterogeneous si*iD»ng pattern for taetof- 
cmn protein is also, seen frequently Id ponnal endometrium. 




a slightly higher molecular weigh! than ihc protein present in nor id a) 
tissue. This could be due lo alterations in the processing of the 
lactoferrin mRNA, protein, or glycosylation by. the malignant cells. 
Alternatively, there could be minor differences between lactoferrin 
protein which is present in neutrophils and the form synthesized by 
uterine epithelial cells. "We suggest two hypotheses to explain lactof- 
enin ovejexpressibn in endometrial adenocarcinomas. In the first 
hypothesis, lactoferrin biosynthesis is deregulated by the same pro- 
cesses that lead to the malignant transformation of endometrium. If 
this hypothesis, is true then lactoferrin may be a useful majker for 
endometrial adenocarcinoma investigation, and fuitber research is 
needed to del ermine whether lactofenin plays a contributing role in 
the malignant transformation. A second hypothesis is that laaoferrin- 
positive human endometrial adenocarcinomas evolve from tbe clonal 
expansion of cells residing in the regenerative 2one (zona basal is) of 
normal endometrium. It is interesting to speculate that lactoferrin 
expression in endometrial cancer may be linked to estrogen action in 



some way, since proliferative disorders of human endometrium are 
linked to chronic estrogen exposure over sever a) years and sequencing 
data suggests that the promoter for the human bctoferrin gtne does 
contain an ERE. 

Although the function of lactoferrin is unknown, a variety of 
biological roles have been proposed foi lacioferrin which could link 
this protein to a role in cancer, including the regulation of DNA 
synthesis (25-29), modulation of the immune response (25, 30, 33), 
and iron transport (32). Some forms of lactofenin are reported lo have 
RNase activity (33, 34). Secreted RNases arc involved in develop- 
ment; reproductive Junction, neoplasia, angiogCDesis, and immune 
suppression. (35, 36) If angiogenesis and immunosuppression are 
components of lactofenin RNase activity, these properties could pro- 
mote tumor gjowth. 

In the endometrial adenocarcinomas, we observed a heterogeneous 
expiession pattern foi lactofenin, PCNA, Her-2/«ev, ER, a»d PR. 
With regard to prognosis, patient survival is reportedly worse when 



Tabic J Corrrlorion c$ loctcfernn otut J*H typrexsion in human rntiomciriot odrnot crcinomoi 
An invnx condition indk_2»t5 thai lanoitrrin aixJ pjof cstcront ^cic not cjpjiascd in the same rrcions of the id»m>i *>y immuDohisiocHtnoical rnaiysSs. PR w»s deiecrcd '•-nb me 
anriscra KD6S, »nd tecloftmn wis dried td by a specific polyclonal aniiscra. 
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Fig. 5. Northern analysis of lactofmin mRNA c*p>cssjod confirms ihsl tndom-elriaJ 
adenocarcinomas (tones 3 and V) significantly owocprcss the 2_Mtilobase ttansaipt of 
huroaa lactofenin in comprise* to normal endometrium (I*m« 7 and .?). Tbe Northern 
hybridization data snpports th* in «n> RNA rcsutls and wwfinns thai Iftcttrfwrin KNA 
expression » djsit^ubrrd id uterine adenocarcinomas. Normal uitrinr tissues appear to 
contain, low siody^tttt RNA levels of bctoJerrin, icfkcling a controlled partem of 
protein expression. Eqnivalenl W4A loading and quality lor each spcomtn was demoo- 
straied by elhidiom bromide staining o/ Hit RNA gtls (daw not shown) and by probing for 
B housekeeping gent (gjyceraldchydc-^pbospbalt dehydrogenase) thai doe* not fluctuate 
sigDifiontly with tbe inciabolic suit of ibt tissue. 
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endometrial adenocarcinomas Jose sex steroid receptors (37, 38), have 
a higher proliferative index (39), and demonstrate DNA aneuploidy. 
During the rumor progression of endometrial adenocajonoroas, il 
appears that the loss of steroid hormone rtctptors occurs earlier than 
either the increase in proliferation rate or the development of DNA 
aneuploidy (40). In our study, we note a striking inverse correlation 
between the expression of lactoferrin and PR in the endoiDetrial 
adenocarcinomas. An inverse relationship also has been described for 
HER-2//wh and PR in endometrial adenocarcinomas that correlates 
with patient prognosis. Furthermore, in cancers of the human endo- 
metrium, ovary and breast Ktx-2/neu expression has been associated 
with advanced disease and poor survival (41-43). HER-2/nei* is an 
oncogene that shares sequence homology with the epidermal growth 
factor receptor and is speculated to contribute. to aberrant growth. Of 
note is that lacloferrin biosynthesis in the mouse uterus is associated 
with the expression of the epidermal growth facto?, like HER~2//teu, 
the epidermal growth factor receptor is also frequently overexpressed 
in PR-negative cells of endometrial adenocarcinomas (44). The am- 
plification of growth factor receptor expression in PR-negative endo- 
metrial adeDOcarcinoroas may be associated with the acquisition of 
growth autonomy and hormone independence, which may contribute 
to the poorer prognosis of PR-negative endometrial carcinomas (45). 
Some endometrial adenocarcinomas, including recurrent tumors, can 
be treated successfully with progesterone therapy (46> 47). Although 
the significance of the inverse relationship between lacloferrin and PR 
expression is not known, we speculate that the PR-negative cells do 
not undergo the normal growth inhibition and secretory differentiation 
normally associated with progesterone action. 

A survey of human tissues reveals that lacloferrin is expressed by 
most normal mammalian exocrine glands and may be a prognostic 
marker in tumors (20). Lacloferrin is found in normal ductule breast 
epithelium and in primary breasi carcinomas. In breast tumors, there 
is an inverse correlation between lacloferrin and ERs (20). Notably, 
jactofenin expression in breast cancer may fall into the same category 
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Fig- 6- Wcstins b)o> analysis using an antiserum 
sptci6c Jot human lactoftrrin was performed on 
pro lei a* caliacttd ho id normal and mil>gn2ftl cd- 
domciriuin and separated by SDS-polyacrylamidc 
, ft! electrophoresis. A broad proitin band vUb a 
molccujai v-cight between 70,000 and 30.000 b 
detected in both normal and oroplaslic endometrial 
■ tiisuc "horoOEtnairsj consistent with the rtpontd 
moitculai weijjbt ol humao lacloferrin MX ^be 
roost significant observation ftom the bnmunobloi- 
bnf srudics is thai lactolrrrin prrtcnv is markedly 
citvatr d in i be adenocarcinomas, in cornparisoo to 
norro»l endometrium, which supports' tbc irnimino- 
rytocbccjica) analyii; thsl deroonslraltj * jreatrr 
number of cells positive Jew lacioienin protein in 
the uterine mmors. S. ibr iclatjvc imoont of protein 
toaded in oth lane by Coomassie bfut Maining. 



MW 

markers 



97.4 
Lacloferrin - 



C\J 










CO 


E 


- E 


o 


o 


c 


e 


o 


"o 






ro 


m 


O 


o 


O 


o 


c 


c 


0> 


CD 




O 


< 


< 



CD 
> 



o 
w 

CD 
CO 



68 




MW 

markers 



97.4 — 



68 



CM 
rtJ 

E 
o 
c 

o 

k— 

nS 
o 
o 
cr 

Q> 
73 
<C 



E 
o 
c 
o 



05 
> 



S 2 
< 



o 

If a> 
Ol CO 




1175 



F>g. 7. ConekiitoD of laoofcmn protti* cxprts- 
swn wilh the FCKA, a marker ol politeratios. and 
PR as measured by imnnjnohrsiocheinisiry per- 
formed on serial stcocms; X 2a tn most cases* no 
nlauonsbip between Uctofcnin and PCKA oprev 
siaa is found *n> tiibo normal as rnalijnam "do- 
rDtlmnu (dirta pol shown> However, in 1 of 12 
•ikJiocaicicomas, » i»vcrse cotrebbon » seen 
berwt« bctofenin (4) and PCtfA (B) localiza- 
tion. -Hm smkbg pattern w ccjosisttM too»&b- 
out the entire tumor, svEgtstn> S P 0 "*"^ ? 
crll ode reflation of bctoftnic expression >» 
Ihis adtoocaidjjoroau The boacmpomls e » h2 » l « 
inverse coxiclabofl between bctofmift <0^™ 
exp»tss*» (f>X was seca m »» eight endo- 
metrial adenotaranomas **»ch ejprpstd PR. 



LACTOFERSIK DYSKEGULATJO* B* ENDOMETRIAL CANCER 
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as Ibe othci mailceis foi ER-Degative tumors, such as amplication of 
EGF receptor, HER-2,Wm, and transforming growth fadoi a expres- 
sion which ait associated with poor prognosis. Id gastric carcinomas, 
laao/errin expression is associated with transformation of specific 
cell types including intestinal -type carcinomas, adenomas, and incom- 
plete intestinal metaplasias (28). Although this is complete specula- 
tion at ibis rime, perhaps iactofeirin overexpression in tbc various 
malignancies may complement ihc actions of the growth factor path- 
way molecules and contribute to the autonomous giowth of these 
himois. 

In conclusion, om studies revea) that lactofenin is. associated 
with a unique population of epithelial cells in ibe zona basalis and 
tbal Iactofeirin over expression may be associated with malignant 
information of the human endometrium. Further studies are 
needed to elucidate the role of lactofenio jd normal and patholog- 
ical endometrial physiology. 
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Cell proliferation id human soft tissue tumors correlates with platelet- 
derived growth factor B chain expression: an immuuohistochemical and iu 
situ hybridization study. 

Wang J, Coltrera MP , Gown AM . 

Department of Pathology, University of Washington, Seattle 98 195. 

The authors tested the hypothesis that the B chain of the platelet-derived growth factor 
(PDGF), a known connective tissue mitogen and growth factor, could be expressed by 
human soft tissue tumors, and that its expression could play a role in the control of cell 
proliferation in these rumors. Using a set of 56 soft tissue tumors, including benign 
tumors and all three grades of sarcomas, PDGF-B chain protein was localized using 
unmunohistochemistry and PDGF-B mRNA was localized using in situ hybridization 
The hypothesis that PDGF-B expression was related to cell proliferation was tested by 
simultaneously demonstrating the expression of the proliferating cell nuclear antigen in 
sequential tissue sections of the same tumors. Sixty and 82% of tumors had demonstrable 
PDGF-B mRNA and protein, respectively, with a strong correlation between their 
degrees of expression (P = 0.0001). Among the sarcomas, a strong correlation between 
PDGF-B expression and increasing malignant tumor grade (P = 0.006), and between 
PDGF-B expression and increasing proliferating cell nuclear antigen index (P = 0.01) 
was found. All tumors were also demonstrated to express the beta receptor of PDGF via 
unmunohistochemistry. These studies suggest that PDGF-B expression may be an 
important mediator of cell proliferation control, via an autocrine mechanism, in human 
soft tissue tumors and may correlate with clinical outcome in the sarcomas. 
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Expression of cadherins and catenins in paired tumor and non neoplastic 
primary prostate cultures and corresponding prostatectomy specimens. 

WangJ, Knll p t Torbenson M, Wang Q, Bisceglia M, Stoner j, Thomas' A 
DeFlavia P . Dhir R Becich M J . * 

Department of Pathology, University of Pittsburgh Medical Center, PA, USA. . 

Cadherins are a family of transmembrane proteins that play a crucial role in cell 
differentiation, cell migration, and intercellular adhesion. Cadherins are associated with 
catenms through their highly conserved cytoplasmic domain. Down-regulation of E- 
cadhenn protein has been shown in various human cancers. This study examined the 
expression of cadherins and associated catenins at the mRNA level. Paired tumor and 
nonneoplastic primary prostate cultures were obtained from surgical specimens. 
Quantitative multiplex fluorescence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) and quantitative analysis were performed and correlated with 
immunostain results. Six of seven cases of neoplastic cultures showed moderately-to- 
markedly decreased levels of E-cadherin and P-cadherin mRNA. Similar losses of alpha- 
catenin and beta-catenin mRNA were also observed. The results of QMF RT-PCR 
showed good correlation with the results of immunohistochemical studies based on 
corresponding formalin-fixed sections, In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and associated catenins at the mRNA 
and protein level in most of the cases studied. This down-regulation may play an 
important role in the pathogenesis of prostate cancer. 
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Abstract Cadherins arc a family of transmembrane 
proteins that play a crucial role in cell differentiation 
cell migration, and intercellular adhesion. Cadherins are 
associated with caienins through their highly conserved 
cytoplasmic domain. Down-regulaiioh of E-cadherin 
protein has been shown in various human cancers. This 
study examined the expression of cadherins and associ- 
ated catemhs at the mRNA level. Paired tumor and non- 
neoplastic primary proslate cultures were obtained from 
surgical specimens. Quantitative multiplex fluorescence 

Dx^n? n ^ riplaSCpolymerasc chain reac l">* (QMF 
, j and quantitative analysis were performed and 
correlated with immunostain results. Six of seven cases 
or neoplastic cultures showed moderately-io-markedly 
decreased levels of E-cadherin and P-cadberin mRNA 
Similar losses of a-catenin and ^catenin mRNA were 
also observed. The results of QMF RT-PCR showed 
good correlation with the results of immunohistochem- 
ical studies based on corresponding formalin-fixed sec- 
tions. In conclusion, this paper presents a coordinated 
down-regulation in ihe expression of E-cadherin and 
associated catenins at the mRNA and protein level in 
most of the cases studied. This down-regulation may 
play an important role in the pathogenesis of prostate 
cancer. 
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Prostate cancer is the most common malignant tumor 
and the second leading, cause of cancer death in men. 
J he clinical course of patients with prostate cancer 
varies widely, and different factors contribute to this 
marked clinical variability, including genetic back- 
ground, iiormonal environment, and the invasive 
potential of the tumor. Invasion and metastasis are the 
hallmarks of malignancy and have been closely linked to 
alterations in cell-to-cell adhesion, cell migration and 
interactions with extracellular matrix components [22). ' 

. Cadherins are a family of transmembrane glycopro- 
teins responsible for maintaining the integrity of tissue 
and are involved in cell differentiation, cell migration 
and intercellular adhesion through a calcium-dependent 
mechanism characterized by homotypic adhesion [3£- 
37]. Their highly conserved cytoplasmic domains asso- 
ciate with catenins, a group of intracellular proteins that 
mediate contact between the cadherins and the micro- 
filaments of the cytoskelelon. Each cadherin subclass 
shows a unique tissue distribution: E-cadherin is pre- 
dominantly expressed in epithelial cells and P-cadherin is 
restricted to decidua tissue and the basal or lower layers 
of stratified epithelium (30J. 

The accumulating evidence suggests a decrease or loss 
of function in E-cadherin and P-cadherin in several 
human carcinomas [3, 9, 23, 32). Loss of heterozygosity 
(LOH) at chromosome 16 in the location of the E-cad- 
henn gene is present in a high percentage of prostate 
cancers [6, 21, 26. 28]. Decreased expression of E-cadherin 
is seen in various human malignant tumor cell lines, and 
(he level of decrease correlates with the invasive potential 
of the tumor cell lines fl, 9, 12, 24, 40, 42). In addition, 
many, but not all, immunohistochemical studies using 
rormaJm-fixed, paraffin-embedded tissue have shown that 
the E-cadherin prolein is decreased in prostate cancer and 
the decrease is correlated with tumor grade |4, 8, 13, 27, 
38). Other studies have shown decreased-to^absent 
P-cadhenn levels, but variable E-cadherin levels (33) 
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a- and 0-catenins bind the cytoplasmic domain of 
E-cadherin and link it to thecytoskeleton 1)6,3)]. Down- 
regulation of expression and deletion of a -catenin genes 
were identified in several human cancer cell lines 119, 24, 
Immunohislocheraical studies showed decreased 
fl^catenin staining, which correlated well with the loss of 
E : cadherin staining and patient survival [25, 29, 39J. In 
addition, decreased ^catenin protein expression appears 
to be associated with malignant transformation of epi- 
thelial tissue [34]. These results suggest that cadherins 
and catenins may function as tumor invasion-suppressor 
genes; However, most of these results were obtained in 
studies using transformed tumor cell lines and formalin- 
fixed, paraffin-embedded, tissue. To our knowledge, 
no studies to dale have examined the co-expression of 
cadherins and catenins at the mRNA level using material 
derived from surgical specimens. 

Recently, we have developed the methodology to 
cultivate primary epithelial cells under defined condi- 
tions from surgical prostatectomy specimens [20]. Areas 
of both carcinoma and noh-neop!astic tissue are identi- 
fied grossly, verified histologically, and then cultured 
separately, resulting in paired primary cultures of both 
non-neoplastic and neoplastic epithelium from the same 
patient. The non-neoplastic tissue cultures serve as an 
important control of any person-to-person variability in 
the expression of the genes of interest. 

In this study, we investigated the co-expression 
of cadherin and catenin mRNA, from multiple paired 
primary prostate cultures derived from surgical prosta- 
tectomy specimens using quantitative multiplex fluores- 
cence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) [41, 44]. Immunohistochemistry stud- 
ies of cadherins and catenins were performed on the 
corresponding paraffin-embedded prostate tissue. 

. Materials and methods 

Tissue specimen and primary prostate cultures 

We studied patients with adenocarcinoma of the prostate who 
underwent prostatectomy at the University of Pittsburgh Medical 
Center during the time interval January 1996-January 1998. No 
patient had received treatment prior lo surgery. Fresh prostatec- 
tomy specimens were sectioned and grossly examined, representa- 
tive samples of neoplastic and non-neoplastic tissue were collected, 
and the diagnoses were confirmed by examination of hematoxylin 
and eosin (H&E) -stained sections. Epithelial cells from tumor and 



non-neoplastic areas of the specimen were isolated and tta cul- 
tured with a chemically defined mediurf (CDM) without addition 
of growth factors as previously described. Tissue TragmaiU were 
cut into small pieces and underwent a series of collagenous diges- 
tions Following each digestion, the cells were pelleted by cenU> 
fugalion at 1,000 rpm for 4 MIN [201 To selectively promote 
epithelial cell growth, the pellets were resuspended.and maintained 
in serum-free CDM supplemented with epithelial growth factor 
(EGF) [20). The culture's morphology was examined dairy, and 
epithelial cells were allowed to grow until confluence was ireached, 
between days 7 and 10. 

Isolation of total RNA and synthesis of cDNA 

Total RNA was extracted from the cultured primary prostate 
epithelial cells at first passage according lo the Trizol sorouon 
(Gibco BRL, RockviHe, MD) modified method of Chomczynski 
and Sacchi [7J. The RNA was then quantitated spectropliotomet- 
rically. Two micrograms of total RNA were used for 6nt strand 
cDNA synthesis using oligo-dT primers and MMLV reverse tran- 
scriptase (Gibco BRL, Rockville, MD). 

PCR primers and quantitative multiplex fluorescence PCR 

PCR primers for hepalocyte growth factor (HGF) and c-royc 
were synthesized as previously described |I7). PCR primers for 
E-cadherin, P-cadherin, o-catcnin, ^-catenin, and 0- act in cDNA 
were designed according to cDNA sequences provided by. the 
GeneBank (WVV^ncbi.nlm.govVgenebank/queTy). The primers 
were selected to amplify 150-250 bp target genes and the PCR 
products from each target gene were designed to have a different 
si2e (Table 1)- The reverse primers were synthesized with fluores- 
cein molecules covalentry attached to the 5' end (BRL* RockvDle, 
MD). Twenty-five microliters of PCR reactions for QMF-PCR 
contained primers (20 uM each), cDNA corresponding lo 50 ng or 
total RNA, dNTP^ and reaction buffer. The reactions were am- 
plified for 21 cycles at 94 *C for I roin, 57 *C for 2 min; and 72 *C 
for 2 min. Five microliters of QMF-PCR reactions were mixed with 
an equal volume of sequencing gel loading buffer, denatured, and 
aliquois were clectrophorcsed on an A Bl 373A automated sequencer 
(Applied Biosyslems. Foster Gty, CA) using a matrix specific Tor 
fluorescein (Fig. IA). 

Automated sequencer gels were run for 6 b at 30 W using 
Genescan software (ABJ. Foster City, CA). Lane assignments and 
areas of the peaks corresponding to fluorescent peaks were a^'EJ" 1 
and quantified by the Genescan software using Pholomultipber 
lube (PMT) voltages (Fig. IB). All experiments were done in 
triplicate and the results presented as means and standard devia- 
tions (SD). 



Antibodies and immunohistochemistry 

Five-micron sections were obtained from formalin -fixed, P^ffin- 
cmbedded tissue blocks. They were deparaffinized and ?>yd"icd 
with graduated clhanols. Slides were then microwaved tn M M 



TkMc I Summary of PCR 
primer sequences and PCR 
products 



Primer 


Sequence 


Target gene 


Size of PCR product (bp) 


1 


Forward cccacactglgcccaiciacg 


p*-Actin 


170 


2 


Reverse gellctccttaaiglcacgc 


E-cadherin 


190 


3 


Forward caaagtgggcacagatggtgtg 


4 


Rcvcise clgcttggaltccagaaacgg 


P-cadherin 


220 


5 


Forward gcaagagccagctctgttiagc 


6 


Reverse acltgagclgatlcagtctclgg 


ct-Catenin 


182 


7 


Forward gatggacaactatgagccagg 


8 


Reverse talaccaggcgggaagcalcg 


p Catenin 


218 


9 


Foiward rtctggigccaciaccacagc 


10 


Reverse Igcalgcectcaiclaatgtc 
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F>& 1 A Gcnescan image of 
QMF RT-PCR of 6ve paired 
non- neoplastic and neoplastic 
primary prostate cultures. 
cDNA corresponding to 50 ng 
of lota) RNA was subjected to 
21 cycles of PCR, and all 
reactions were done in tripli- 
cate. RNA of both E-cadheri» 
and Podherb showed modcr- " 
ale-to-marked reduction in pri- 
mary neoplastic cultures in four 
. of Bve cases (Cases I, 2, 4, and 
5). B Representative automated 
sequencer traces (ABJ 373A) 
from 21 cycles of QMF 
RT-PCR using cDNA from a 
paired nonneoplastic (top 
panel) and neoplastic (bottom 
panel) primary prostate cultures 
(Case 2X and copy number of 
PCR products (ar-actin, 
E^adherin, and P-cadhexin) 
was calculated using peak areas. 
Neoplastic culture showed 
markedly decreased expression 
of both E-cadhcrin and 
P-cadberin 



220 bp 



mho 1 



170 hp 



<:> d f J*J a u v « " -v» i- .. « t ; , 



beta- Actio 

(ITObp) 



E-cadherin 

(I90bp) 



noo-ccoplastic 



P-cadhtrin 

(MM?) 




neoplastic 



^mlL 1 " 1 ^:.^", 6 ^ f °' aD,i « en ,e " itv »' 'vidin-biotin 
compk, method fo, .mmur»r»stqchemislry was utilised ftcm 

pS.dhtrin^™ 1 <Bu ' li °P"«. USA). The E-cadherin, 

W^fi , ."^ a " d ^""'n "ntibodies. all mouse mono- 

Evaluation of immunostaining 

The peicentage of positive cells, intensity of lhe staining, and cel- 
hjb. local )M .,„„ of the staining we,e examined by two indent 
observers ustng normal pr os,a,e epithelium from the sanT^" 

aTir." C °""°! ^ in,Cnsil > of ,he was graded 

» suong. moderate, weak, and negative. The staining pattern or 

«ml U s^i:: n s COmPa ' Cd Wi ' b °' *»**Xr, m the' 

Rwults 

Expression of E-cadherin and P-cadherin 

Since many prostate cancers are known lo be histolog- 
ically heterogeneous, adjacent H&E-stained sections 
ol the tissue fragments sent lo culture were reviewed 



to assess tissue homogeneity and to rule out the presence 
of olher diseases. The paired primary cultures for this 
study were selected based on the following histologic 
features: (I) non neoplastic tissue showed no cancerous 
foci or high-grade prostatic intraepithelial neoplasia 
(PIN); (2) neoplastic sections contained less than 5% 
non-cancerous epithelium. Seven -out of 38 pairs of 
cultures met the selection criteria and were included in 
this study. Histologically, all seven cases were moder- 
ately differentiated adenocarcinomas, with Cleason 
scores ranging from 5 to 7 (median = 6). In six of these 
cases, adjacent sections of tumor contained no benign 
prostate epithelium, and in one case (specimen 4), there 
was less than 5% non-cancerous epithelium. Prostate 
stromal cells express HGF, whereas the epithelial cells 
express c-met, the receptor for HGF [J7, 20]. The cases 
included m this study showed no detectable HGF 
expression after 21 cycles of QMF RT-PCR (data not 
shown). This indicates there was do significant stromal 
cell contamination in the current epithelial cultures. 

QMF-PCR is an accurate method of measuring the 
relative levels of mRNA in small tissue samples [41]. In 



311 



: this sludy, we used this method to quantitate the mRNA 
levels of E-cadherin and P-cadberin, relative to 0-actin. 
AH the RNA samples contained no DMA contamina- 
tion, as shown by the absence of automated signals when 
RT was omitted from the RT-PCR reactions. We ob- 
served the expected linear increases in 0-actin, E-cadh- 
erin, and. P-cadherin signal intensities between cycle 
numbers 18 and 24, with a cDNA input corresponding 
to 50 ng of total RNA. The ratios of /f-aclin to 
E-cadherin and P-cadherin were constant, as was the 
E-cadherin/P-cadherin ratio. These results indicated that 
the target genes were consistently amplified in the reac- 
tions. Twenty-one cycles or PCR were chosen for all 
subsequent experiments. In addition, the raw sequencer 
quantitation of peak areas for jJ-actin using the same 
cDNA input was similar among the paired primary 
prostate cultures (data not shown). 

When compared with the paired non-neoplastic pri- 
mary cultures and normalized with the /J-actin internal 
controls, four of the neoplastic cultures showed marked 
(>85%) reduction of E-cadherin mRNA levels* with 
one case showing no detectable E-cadherin mRNA. Hie 
other three neoplastic cultures showed mild-to-moderate 
reductions (Table 2). 

Six of seven neoplastic cultures showed moderately 
to markedly decreased P-cadherin mRNA levels when 
compared with non-neoplastic cultures. Interestingly, 



the four cases showing marked reduction in levels of 
E-cadbetin mRNA, and also demonstrated significant 
losses of P-cadherin mRNA. The case with no det ectable 
E-cadherin mRNA also demonstrated near total loss 
of P-cadherin mRNA (Case 1). Another case (Case 3), 
with only mjld reduction. in E-cadherin mRNA, showed . 
no significant change in Prcadherin mRNA. Genetic 
variations among the patients were evident in the 
marked differences in the baseline levels of E-cadherin 
and P-cadherin expression seen in the rion- neoplastic 
primary prostate cultures. 



Expression of a-calenin and #-cateniri 

The highly conserved inlracytoplasmic domain of 
the cadherins interacts with cr- and /£catenins, with the 
catenins serving as a link between the cadherins and the 
microfilaments of the cytoskeletbri. Six cases of neo- 
plastic cultures showed rnild-to-moderate reductions in 
or-catenin mRNA levels, ranging from 26 to 62%. These 
cases also demonstrated more severe reductions of 
0-catenin mRNA levels and generally correlated with 
the changes of cadherins in the same specimen (T able 3). 
Interestingly, the case with no significant change of 
P-cadherin mRNA and only a mild loss of E-cadherin 
mRNA also showed no change in mRNA levels of both 



Table 2 E-cadherin and P-cadherin expression in' paired non- neoplastic and neoplasty primary prostate cultures derived from prosta- 
tectomy specimens , „ 



Case 


E-cadherin" 




Percentage 
loss in tumor*" 


P-cadherin* 




Percentage 
loss in tumor' 




Non-neoplastic 


Tumor 


Non-neoplastic 


Tumor 


1 

2 
3 
4 
5 
6 
1 


71.0 ± 2.2 

27.4 ± 6.4 
18.0 ± 0.2 

93.5 ± IZO 
4I.5 d 3.0 

56.7 ± l.l 

53.8 ± 6. 1 


ND C 

3.7 db 0.3 
12.6 ± 1. 9 

5.4 d 0.9 
• 24.4 d 4.0 

8.1 ± 10 
22.2 ± 0.7 


too 

86 
30 
94 
41 
86 
59 


65.7 ± 2.0 

53.4 ± 6.5 

85.8 ± 6.5 
87.8 ± 7.0 

30.5 ± 1.4 
71.8 ± 2.9 
62.7 ± 3.5 


0-8 ± 0.1 
17.8 ± ).3 

86.4 ± 9.5 

25.5 ± 2.5 
18.1 ± Z4 
17. 1 ± 2J2 
19.4 ± 22 


99 
67 
3 
71 
41 
76 
69 



• Level of E-cadherin and P-cadherin are normalized with /J-aclin from the same sample: (E-cadherin or P-cadherin)/actin x 100 
^Percentage loss in tumor primary culture: 1<N - T)/N) x 100% 
c ND. not delectable 

Table 3 a-Catenin and P-calcnin expression in paired non- neoplastic and neoplastic primary prostate cultures derived Trom prosta- 
tectomy specimens . * . .. 



Case 



Non-neoplastic 



Percentage loss 
in tumor b . 



p^catcnin" 



Tumor 



Non-neoplastic 



Tumor 



45.1 ± 7.1 

49.2 d 1.6 

48.0 d 5.9 

61.1 d 2.5 

58.2 d 1.8 . 
46.7 d 4.7 
43.2 d 2.6 



17 3 ± 0:5 

36.2 ± 4 0 
47.8 d 9.0 
40.8 d 2.6 
27.8 d 1.6 
33.6. d 4/7 
26.4 d 1.7 



62 
26 
<l 
33 
52 
28 
39 



19.7 d. 3-6 

21.7 ± 1.5 
40.1 ± 5.3 

36.8 ± 
29.8 A 
31.5 ± 
22.3 d 



18 

2.0 
4.2 
1.5 



ND r 

3.9 ± 
38.9 d 

22.1 d 

15.2 d 

13.5 d 

11.6 d 



0.2 
6.2 
1.9 
II 
1.6 
1.4 



percentage ^ 
loss »n tumor 



100 

82 
3 
40 
49 
57 
48 



"Level of a-caienin and 0-caleoin ate normalized wilh p-ac\m from the same sample: (catenin/actm) x 100 
b Percentage loss in tumor primary culture: 1(N - T)/N) x 100% 
1 ND, not delectable 
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a- and /f-catenins. Moderate interspecimen variation was 
observed in the baseline expression of a-catenin and 
^-catenin mRNA levels in the non-neoplastic cultures. 



Immunohistochemical studies of cadherins and catenins 

In benign prostate tissue, E-cadherin was, in all cases, 
uniformly localized to tbe membranes of luminal glan- 
dular epithelial cells, predominantly at cell-cell junctions 
(Fig. 3A). One case of prostate cancer showed complete 
negative staining for E-cadherin (Fig; 3B)„ and the re- 
maining six cases demonstrated reduced immunostaining 
for E-cadherin, with 25-75% of cancer cells positive 
(Fig. 3C). The cancerous glands generally showed re- 
duced signal intensity and an altered heterogeneous 
staining pattern, which included focal cytoplasmic stain- 
ing and reduced membranous staining (Fig. 3C, Table 4). 

Benign prostate tissue, showed uniform basal cell 
immunoreactivity for P-cadherin, with principally cyto- 
plasmic and focal membranous pattern staining. The 

T»We 4 Irnmunohistochemical expression of cadherins and cate- 
ci"lure PrQSta,CCl0my 5pec:mcD5 corrc5 P° ndin S tD primary prostate 



benign, glandular non-basal epithelial cells and stromal 
cells were negative for P-cadherin (Fig. 3D)i. P-cadherin 
immunoreactivity was completely absent in two cases of 
prostate cancer (Fig. 3E), and the remaining five cases 
showed variable focal positivity, which was predomi- 
nantly cytoplasmic (Fig. 3F). This focal P-cadherin 
immunostaining positivity was confirmed by staining 
multiple sections and by using different monoclonal 
antibodies; In some cases, serial sections also appeared 
to show immunostaining for both E- and P-cadherins 
with the same neoplastic cells. 

In all cases of benign prostate tissue, a- and /f-catenm 
protein expression showed strong homogeneous staining 
of the luminal glandular epithelium and the basal cells. 
In a pattern similar to that of normal Ercadherin, the a- 
and ^-catenins were localized predominately at luminal 
epithelial cell-cell borders (Fig. 2A, C> In all cases of 
prostate cancer, there was a mild-to-moderate reduction 
in staining for both catenins with 50-75% of cells 
positive, and the staining tended to be heterogeneous 
(Fig. 2B, D; Table 4). 



Discussion 



Case 


E-cadherio 


P-cadherin 


" or-catenin 


fi-cztenm 


1 






+ + 


+ + 


2 


+ + 


+ 


+ + 


+ + 


3 


+ + + 


+ + 


+ + + 


+ + + 


4 






+ + 


+ + + 


5 


f + + 




+ + + 


+ -f + 


6 


+ + 


+ 


4 + + 


+ + + 


7 


+ + 


+ 


+ + 


+' + + ' 



"cganvc; t- , positive; -+ 25-5 

50-75% positive; + + + + , >75% positive 



In this study, we observed a coordinated dowh-regula 
lion of the expression in the genes involved in the 
cadherin and catenin mediated ccllr-ccll pathway at the 
mRNA level. The protein levels, as demonstrated by 
the immunohistochernical studies on the corresponding 
tissue sections, were generally consistent with the 
mRNA data as well as with that reported in the litera- 
ture [18, 33, 34, 38, 39]. 

E-cadherin showed the most consistent loss of ex- 
pression at both the mRNA level and the protein level. 



Fig. ZA-D lmmunohistocherm- 
cat staining of o- and /f-caleoin 
in non-neoplastic prostate and 
in prostate adenocarcinoma. 
Original magnification xl!5. 
A, C Normal membranous 
expression of a-calcnin anil 
^-catenin in non-neoplastic 
prostatic epithelium (Case 1). 
B Decreased immunostaining 
for a-catenin in prostate . 
adenocarcinoma (Case \). 
B Decreased and heterogenous 
immunostaining for /7-catemn 
in prostate adenocarcinoma 
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Fig. 3 A Immunohrstochcmical 
staining of E- and P-cadherin in 
nonneoplastic prostate and in 
prostate adenocarcinoma. 
Original magnification xl!5. 
A Normal membranous ex- 
pression of E-cadherin in non- 
ncopraslk prostatic epithelium 
(Case I). B, C Complete 
negative (Case I) and focal 
heterogeneous (Case 3) immu- 
nostaining for E-cadherin in 
prostate adenocarcinoma. 
D Norma) continuous basal 
layer expression of P-cadhcrin 
in a portion of norv neoplastic 
prostatic epithelium (Case 1). 
E, F Complete negative 
(Case )) and mild focal hetero- 
geneous (Case 3) irorounoslain- 
ing for P-cadhcrin in' prostate 
adenocarcinoma 




LOH at chromosome 16q, where both E-cadherin and 
P-cadherin are located, occurs in up to 30% of prostate 
cancers. Four of our cases showed a greater than 50% 
reduction in mRNA, ranging from 87 to 100%, sug- 
gesting that mechanisms in addition to LOH may play a 
role in the reduction of E-cadherin mRNA levels below 
that of the 50% predicted by LOH alone. 

Previous studies have suggested that P-cadherin 
could serve as a specific marker, for basal cell differen- 
tiation and was not expressed in prostate cancer, 
although a recent study has shown focal P-cadherin 
expression in some prostate tumors [J 8). In this study, 
we demonstrated P-cadherin mRNA in six of seven 
neoplastic cultures, although it was significantly reduced 
in all six cases. It is impossible to rule but the possibility 
that some of the P-cadherin mRNA expression may 
have resulted from potential contamination by small 
numbers of nonneoplastic basal cells admixed with the 
neoplastic cells within the culture material. However, we 
favor the interpretation that low levels of P-cadherins 
are expressed in cultured tumor cells, as well as weakly 
in some tumors in vivo owing to disruption of normal 
gene regulation. This interpretation is favored by the 
presence of focal immunostaining for P-cadherin protein 
in histologically neoplastic cells in five of seven cases. 
The histologic selection criteria (requiring minimal to no 
benign prostate glands) should also have minimi2ed 
major contamination. Despite the presence of mRNA in 
primary tumor cultures and focal positive immuno- 



staining, P-cadherin immunostaining may still serve as a 
useful basal cell marker because the staining pattern was - 
distinctly abnormal in the cases where it was focaJly 
present. 

In this study, the levels of ct- and 0-catenin mRNA 
were also lower than normal in six of seven cases, 
though the reductions were relatively Jess than those of 
cadherins, especially for the o-catenins. Catenin. protein 
expression was generally moderate to weak by immu- 
nohistocheroistry and showed a heterogeneous cyto- 
plasmic and weak membranous staining pattern; The 
immunopositivity was generally similar in most cases, 
with 25-75% cells showing positivity; The results of the 
immunohistochemica) studies were generally consistent 
with the mRNA data; although not in all cases (e.g.. 
Case I); perhaps due to tumor heterogeneity. 

In this study, we also observed a coordinated down- 
regulation of E-cadherin and the calenins in most cases; 
this was most observable at the mRNA level. The cad- 
herins are tightly regulated during embryogeriesis and 
appear to serve the need for precise spatio-temporal 
regulation. The promoter sequences of both E-cadhenn 
and P-cadherin have been cloned and functionally ana- 
lyzed (2, 5, 10, 15]. Both promoters have similar regu- 
latory elements, such as GC-rich regions and GC A AT 
boxes. Although the two promoters share similar se- 
quences, in vitro binding studies suggest that the two 
promoters are regulated by different transcriptional 
factors (II). The tissue specificity of these promoters 
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appears to be derived by different combinations of a 
relatively few feelers common lo many types of tissues, 
and does not appear to be derived from transcription 
factors specfic for each type of tissue. In our study, the 
degree of loss of expression of a- and p^atenins at the 
r tV ^ WaSCOrre,atcd wit, » ^«cHons « the levels 
ni Jr"^-" O ? ression - Ahhough the exact mecha- 
nism of th,s coordinated down-regulation is not known 

™. COOr t ,a ' C ? V*"™ »JWil* hypothesis tha 
loss or alteration of some regulatory fectors occurs 

f"?"i P ? Sta,C ! Umor P ro 8'«si'>n. Possible mechanisms 
indude transcr.pt.onal factor alterations or hyper- 
metiylauon or the promoter region |14, 431. LinTed 
mformauon is available on ^IJ^JJ^S- 
uon of the catemns. but it is possible that similar 
mechanisms may play a role. 

exD J rS„1^K ,hiS h ** of the 

IS*? S it* ^ ncs ,nvo,ved >» cadherin-medi- 
atedcell-celiadhesion pathway at the mRNA level using 
pawed neoplasUc ami non-neoplastk primary culture! 

wdjeate (bat (I) there » a marked patienl-lo-palienl 
var.at.on m the normal levels of the cadherins and 
catemns; (2) mRNA level, of E-cadberin as well 2 
caten.ns are s.gmficantly reduced in some prostate can- 
cerpnmary cultures, and the reduction lends to be to the 

regufaung mechanism common to all of these genes; (3) 

r^A° , fr° tC - n ,eV£ ' in SMn = P^tate cancers. This 
c^rdmated down-regulation ofE-cadberin and cater,™ 

tumor pathogenesis and metastasis. 
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Down-regulation of prostate-specific antigen expression by finasteride 
through inhibition of complex formation between androgen receptor and 
steroid receptor-binding consensus in the promoter of the PSA gene in 
LNCaP cells. 

Wang LG , Liu XM Kreis W. Bitdman DR . 

Department of Medicine, New York University, Manhasset 1 1030, USA. 

As a specific competitive inhibitor of Salpha-reductase, an intracellular enzyme that 
converts testosterone to di hydro testosterone, finasteride is being extensively used for the 
treatment of benign prostatic hyperplasia and in experimental settings for prostate cancer. 
In this study, we showed that finasteride markedly inhibited prostate-specific antigen 
(PSA) secretion and expression. The promoter of the PSA gene contains several well- 
known cis-regulatory elements. Among them, steroid receptor-binding consensus (SRBC) 
has been identified as a functional androgen-responsive element Our previous study 
showed that PSA was not only present in conditioned medium of the PSA-positive 
LNCaP cells but was also detectable in small amounts in PSA-negative cell lines, PC-3 
and DU-145 (L. G, Wang et all, Oncol. Rep., 3: 911-917, 1996). A strong correlation 
between binding of nuclear factors to SRBC and the level of PSA present in the 
conditioned medium and cell extracts was found in these three cell lines, whereas no such 
correlation with binding was obtained using Spl oligonucleotide as a probe. Binding of 
LNCaP cell nuclear proteins to SRBC was diminished when the cells were exposed to 25 
microM finasteride, at which concentration 50% of both PSA mRNA and protein were 
inhibited. As a major component of DNA-protein complexes, the level of androgen 
receptor was dramatically decreased in the cells treated with finasteride. Our data indicate 
that inhibition of complex formation between SRBC and nuclear proteins due to the 
remarkable decrease in the level of androgen receptor plays a key role in the down- 
regulation of PS A gene expression by finasteride in LNCaP cells. 
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Expression of calcyclin in human nielanocytic lesious. 
Wcterrnan MA van Muijen GN , Blocniers LIP . Ruiter DJ , 
Department of Biochemistry, University of Nijmegen, The Netherlands. 

When comparing two subsequent stages of melanocyte tumor progression we identified 
calcyclin as a new potential progression marker, the expression of which was correlated 
with metastatic behavior of various human melanoma cell lines in nude mice. In this 
study, we describe a good correlation between RNA and protein levels in the xenograas 
of these cell lines and extended these experiments to a panel of 120 routinely processed 
human melanocyte cutaneous lesions. Northern blot analysis demonstrated that calcyclin 
RNA expression was elevated in melanoma metastases as compared to several types of 
nevocellular nevi. Calcyclin staining using a specific polyclonal antiserum showed a 
more complex pattern. A stronger staining in a higher percentage of positive cells was 
observed in thick primary melanoma (> or= 1.5 mm) as compared to thin primary 
melanoma (< 1 .5 mm). Calcyclin expression was also present in a higher percentage of 
cells showing a stronger staining in melanomas with higher Clark levels (> II) 
corresponding to the vertical growth phase of primary melanomas. Protein expression in 
nevocellular nevi was confined to the dermal part and was highest in the lower parts of 
the dermis. Remarkably, dysplastic nevi (atypical moles), potential precursors of 
melanoma, did not show any expression at all, either in junctional or derma] parts. 
Confinement of the expression to the dermal part of nondysplastic nevi and primary 
melanomas may reflect interactions with the microenvironment of the reticular dermis 
that occurs with vertical growth! 
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Estrogen regulation of the cytochrome P450 3 A subfamily in humans. 

Williams ET, Lcyk M Wrighton SA , Davies PX Loose PS , Shipley GL . Strobel HW . 

Department of Biochemistry, Medical School, University of Texas Health Science Center 
at Houston, 643 1 Fannin, MSB 6.200, Houston, TX 77030, USA. 

This study examines the possible rote of estrogen in regulating the expression of the 
human CYP3A subfamily: CYP3A4, CYP3A5, CYP3A7, and CYP3A43. To accomplish 
this goal, mRNA was quantified from human livers and endometrial samples, and total 
CYP3A protein levels were evaluated by Western immunoblot analysts of the liver 
samples. The human endometrial samples were from premenopausal and postmenopausal 
women. The premenopausal endometrium was either in the proliferative or secretory 
phase, whereas for the postmenopausal endometrium samples, the women had been 
treated with either a placebo or estropipate, an estrogen substitute. After analyses, 
CYP3A4mRNA was shown to have lower hepatic expression in females than in males. 
In the endometrium, CYP3A4 and CYP3 A43 are down-regulated by estrogen, whereas 
CYP3A5 is expressed at higher levels during the secretory phase, CYP3A7 was not 
detected in the endometrium. In addition, the CYP3A subfamily showed increased 
mRNA expression in the liver as age increased. The expression levels of total CYP3A 
protein and total CYP3A mRNA showed good correlation. Despite apparent regulation of 
CYP3 A4 mRNA expression by estrogen, the effects of estrogen may be overshadowed 
by additional regulators of gene expression. 
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Severely decreased MARCKS expression correlates with ras reversion but 
uot with mitogenic responsiveness. 

Wojtaszck PA, Stumno DJ Blackshear PJ . Macara IG 

05To5*' nent ° f Path ° ,0gy * Univeisit y °f Vennont College of Medicine, Burlington 

Phorbol ester-inducible phosphorylation of MARCKS, the *80-kDa' substrate of protein 
kinase C, was undetectable in several phenotypically dominant, non-transformed 
revertants independently derived from the ras-transformed cell line NIH3T3 DT-ras 
Extremely low expression of MARCKS protein accounted for this apparent lack of 
phosphorylation. MARCKS -encoding rriRNA levels were correspondingly decreased 
relative to normal and ras-transformed cells in all four ras revertant cell lines studied: C- 
1 1 and F-2, derived by 5-azacytidine treatment and selection with ouabain; CHP 9CJ, 
derived by ethylmethane sulfonate mutagenesis and selection with cis-hydroxy-L-pro'line- 
and 12 : V3, derived by transfection with the human Krev-1 gene. However, re-expression' 
of MARCKS after transfection of a cloned MARCKS cDNA into the C-U ras revertant 
cells was not sufficient to induce retransformation. In fact, no significant difference in 
sensitivity to mitogenic stimulation by phorbol esters was observed among several cell 
lines expressing widely varying levels of MARCKS. This evidence argues against a 
direct role for MARCKS in mitogenic signaling. However, the strong correlation between 
attenuation of MARCIS expression and phenotypically dominant ras reversion suggests 
that a common negative regulatory mechanism might be responsible for both effects 
presenting a potentially useful strategy for identifying factors involved in transducing the 
ras signal. 
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Abstract 



Background 
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Prostate stem ceif antigen (PSCA) is a recently defined homologue of the 
Thy-l/Ly-6 family of glycosylphosphatidylinositol (GPJ)- anchored cell 
surface antigens. The purpose of the present study was to' examine the 
expression status of PSCA protein and mRNA in clinical specimens of human 
prostate cancer (Pea) and to validate it as a potential molecular target for 
diagnosis and treatment of Pea. 



Materials and Methods 

imrnunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections 
from 20 benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial 
neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was 
semiquantitative^ scored by assessing both the percentage and intensity of 
PSCA- positive staining cells in the specimens. 1 hen compared PSCA 
expression between BPH, PIN and Pea tissues and analysed the correlations 
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of PSCA expression level with pathological grade, clinical stage and 
progression to androgen-independence in Pea. 

Results 

In BPH and low grade PIN, PSCA protein and mRNA staining were weak or . 
negative and less intense and uniform than that seen in HGPIN and Pea. 
There were moderate to strong PSCA protein and mRNA expression in 8 of 
11 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by 
IHC and ISH analyses, with statistical significance compared with BPH 
(20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). The 
expression level of PSCA increased with high Gleason grade, advanced stage 
and progression to androgen-independence (p < 0.05, respectively). In 
addition, IHC and ISH staining showed a high degree of correlation between 
PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is 
overexpressed by a majority of human Pea. PSCA expression correlates 
positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and 
androgen-independence, and speculatively with prostate carcinogenesis. 
PSCA protein overexpression results from upregulated transcription of PSCA 
mRNA. PSCA may have prognostic utility and may be a promising molecular 
target for diagnosis and treatment of Pea. 



Outline Introduction 



Discussion 



*tr;ict Prostate cancer (Pea) is the second leading cause of cancer-related death in 
American men and is becoming a common cancer increasing in China. 
Despite recently great progress in the diagnosis and management of 
localized disease, there continues to be a need for new diagnostic markers 
that can accurately discriminate between indolent and aggressive variants of 
Pea. There also continues to be a need for the identification and 
characterization of potential new therapeutic targets on Pea cells. Current 
Refcre-nres diagnostic and therapeutic modalities for recurrent and metastatic Pea have 
been limited by a lack of specific target antigens of Pea. 

Although a number of prostate-specific genes have been identified (i.e. 
prostate specific antigen, prostatic acid phosphatase, glandular kallikrein 2), 
the majority of these are secreted proteins not ideally suited for many 
immunological strategies. So, the identification of new cell surface antigens 
is critical to the development of new diagnostic and therapeutic approaches 
to the management of Pea. 

Reiter RE et al [.1] reported the identification of prostate stem cell antigen 
(PSCA), a cell surface antigen that is predominantly prostate specific. The 
PSCA gene encodes a J 23 amino acid glycoprotein, with 30% homology to 
stem cell antigen 7 (Sea 2). Like Sca-2, PSCA also belongs to a member of 
theThy-j/Ly-6 family and is anchored by a glycosylphosphatidylinositol 
(GPJ) linkage. mRNA in situ hybridization (ISH) localized PSCA expression in 
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normal prostate to the basal cell epithelium, the putative stem cell 
compartment of prostatic epithelium, suggesting that PSCA may be a 
marker of prostate stem/progenitor cells. 

In order to examine the status of PSCA protein and mRNA expression in 
human Pea and validate it as a potential diagnostic and therapeutic target • 
for Pea, we used immunohistochemistry (IHC) and in situ hybridization 
(ISH) simultaneously, and conducted PSCA protein and mRNA expression 
analyses in paraffin-embedded tissue specimens of benign prostatic 
hyperplasia (BPH, n = 20), prostate intraepithelial neoplasm (PIN, n = 20) 
and prostate cancer (Pea, n = 48). Furthermore, we evaluated the possible 
correlation of PSCA expression level with Pea tumorigenesis, grade, stage 
and progression to androgen-independence. 
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Tables 
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Tabfe 1 

Correlation of PSCA 
expression with Gleason 
score 



Tsble 2 

Correlation of PSCA 
expression with clinical 
stage 



Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 
patients of 57-84 years old by prostatectomy, transurethral resection of 
prostate (TURP) or biopsies. The patients were classified as 20 cases of BPH, 
20 cases of PIN, 40 cases of primary Pea, including 9 patients with recurrent 
Pea and a history of androgen ablation therapy (orchiectomy and/or 
hormonal therapy), who were referred to as androgen-independent prostate 
cancers. Eight specimens were harvested from these androgen-independent 
Pea patients prior to androgen ablation treatment- Each tissue sample was 
cut into two parts, one was fixed in 10% formalin for IHC and the other 
treated with 4% paraformaldehyde/0.1 M PBS PH 7-4 in 0.3% DE.PC for 3 h 
for JSH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 pm sections and mounted on the glass slides 
specific for IHC and ISH respectively in the usual fashion. H&E-stained 
section of each Pea was evaluated and assigned a Gleason score by the 
experienced urological pathologist at our institution based on the criteria of 
Gleason score [2]. The Gleason sums are summarized in Table 3 . Clinical 
staging was performed according to Jewett-whitmore-prout staging system, 
as shown in Table 2. In the category of PIN, we graded the specimens into 
two groups, i.e. low grade PIN (grade J - 11) and high grade PIN (HGPJN, 
grade III) on the basts of literatures [3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in 30 mmol/t citrate buffer, PH 6.0 {Boshide, Wuhan, China) in 
a 900 W oven for 5 rnin to induce epitope retrieval. Slides were allowed to 
cool at room temperature for 30 min. A primary mouse antibody specific to 
human PSCA (Boshide, Wuhan, China) with a 3 :3 00 dilution was applied to 
incubate with the slides at room temperature for 2 h. Labeling was detected 
by sequentially adding biolinylated secondary antibodies and strepavidin- 
peroxidase, and localized using 3,3' diaminobenzidine reaction. Sections 
were then counterslained with hematoxylin. Substitution of the primary 
antibody with phosphate- buffered saline (PBS) served as a negative- 
staining control. 
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mRNA in situ hybridization (ISH) 

Five-pm-thick tissue sections were deparaffinized and dehydrated, then 
digested in pepsin solution (4 mg/ml in 3% citric acid) for 20 min at 37.5°C, 
and further processed for ISH. Digoxigenin-labeled sense and antisense 
human P5CA RNA probes (obtained from Boshide, Wuhan, China) were 
hybridized to the sections at 48°C overnight. The posthybridization wash 
with a high stringency was performed sequentially at 37°C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 15 min and in 0.2 x SSC 
for 30 min. The slides were then incubated to biotinyrated mouse anti- 
digoxigenin antibody at 37.5°C for 1 h followed by washing in 1 x PBS for 
20 min at room temperature, and then to strepavidin-peroxidase at 37.5°C 
for 20 min followed by washing in 1 x PBS for 15 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. 
Sections hybridized with the sense control probes routinely did not show any 
specific hybridization signal above background, All slides were hybridized 
with PBS to substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA immunostaining 
' and mRNA in situ hybridization, the same scoring manners are taken in the 
present study for PSCA protein staining by IHC and PSCA mRNA staining by 
ISH. Each slide was read and scored by two independently experienced 
urological pathologists using Olympus BX-41 light microscopes. The 
evaluation was done in a blinded fashion. For each section, five areas of 
similar grade were analyzed semiquantitative^ for the fraction of cells 
staining. Fifty percent of specimens were randomly chosen and rescored to 
determine the degree of interobserver and intraobserver concordance. There 
was greater than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a 
scale of .O to 3+ with 3 being the highest expression observed (0, no 
stairung; 1 + , mildly intense; 7-v , moderately intense; 3+, severely intense). 
The staining density was quantified as the percentage of cells staining 
positive for PSCA with the primary antibody or hybridization probe, as 
follows: 0 = no staining; 1 = positive staining in <25% of the sample; 2 = 
positive staining in 25%-50% of the sample; 3 = positive staining in >50% 
of the sample. Intensity score (0 to 3 + ) was multiplied by the density score 
(0-3) to give an overall score of 0-9 | .1,5]. In this way, we were able to 
differentiate specimens that may have had focal areas of increased staining 
from those that had diffuse areas of increased staining [6]. The overall score 
for each specimen was then categorically assigned to one of the following 
groups: 0 score, negative expression; 1-2 scores, weak expression; 3-6 
scores, moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PIN and 
Pea tissues were compared using the Chi-square and Student's /-test. 
Univariate associations between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were calculated using 
Fisher's F_xnct Test. For all analyses, p < 0.05 was considered statistically 
significant. 
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Figures 



PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed weakly in individual 
samples of BPH. Some areas of prostate expressed weak levels (composite 
score 1-2), whereas other areas were completely negative (composite score 
0). Four cases (20%) of BPH had moderate expression of PSCA protein and 
mRNA (composite score 4-6) by IHC and ISH. Jn 2/20 (10%) BPH 
specimens, PSCA mRNA expression was moderate (composite score 3-6), 
but P$CA protein expression was weak (composite score 2) in one and 
negative (composite score 0) in the other. PSCA expression was localized to 
the basal and secretory epithelial cells, and prostatic stroma was almost 
negative staining for PSCA protein and mRNA in all cases examined. 

PSCA expression in PIN 



j-sgure .l 

Representatives of PSCA 
3HC and ISH staining in Pea 
"(A. IHC staining, B. ISH 
staining, x200 
magnification) 



In this study, we detected weak or negative expression of PSCA protein and 
mRNA (^2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of 11 (18.2%) 
HGPIN, and moderate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate PSCA mRNA 
expression (6 score) was found weak staining for PSCA protein (2 score) by 
IHC Strong PSCA protein and mRNA expression (9 score) were detected in 
the remaining 3 of 11 (27.3%) HGPIN. There was a statistically significant 
difference of PSCA protein and mRNA expression levels observed between 
HGPIN and BPH (p < 0.05), but no statistical difference reached between 
low grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared 
with benign glands, Forty-eight paraffin-embedded Pea specimens were 
analysed -by IHC and ISH. It was shown that 19 of 48 (39.6%) Pea samples 
stained very strongly for PSCA protein and mRNA with a score of 9 and 
another 21 (43.8%) specimens displayed moderate staining with scores of 
4-6 (Figure 1). In addition, 4 specimens with moderate to strong PSCA 
mRNA expression (scores of 4-9) had weak protein staining {a score of 2) 
by IHC analyses. Overall, Pea expressed a significantly higher level of PSCA 
protein and mRNA than any other specimen category in this study (p < 
0.05, compared with BPH and PIN respectively). The result demonstrates 
that PSCA protein and mRNA are overexpressed by a majority of human 
Pea. 

Correlation of PSCA expression with Gleason score in Pea 

Using the semi- quantitative scoring method as described in Materials and 
Methods, we compared the expression level of PSCA protein and mRNA with 
Gleason grade of Pc3, as shown in Table .1 . Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well- differentiation, 5-7 scores = 
rnoderate-drfferenttation and 8-10 scores = poor-differentiation {/]. 
Seventy- two percent of Gleason scores 8- JO prostate cancers had very 
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strong staining of PSCA compared to 21% with Gleason scores 5-7 and 17% 
with 2-4 respectively, demonstrating that poorly differentiated Pea had 
significantly stronger expression of PSCA protein and mRNA than 
moderately and well differentiated tumors (p < 0.05). As depicted in Figure 
1, IHC and ISH analyses showed that PSCA protein and mRNA expression in 
several cases of poorly differentiated Pea were particularly prominent, with 
more intense and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed, the 
results in Table 2. Seventy- five percent of locally advanced and node 
positive cancers (i.e. C-D stages) expressed statistically high levels of PSCA 
versus 32.5% that were organ confined (i.e. A-B stages) (p < 0.05). The 
data demonstrate that PSCA expression increases significantly with 
advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen-, 
independent progression of Pea 

AN 9 specimens of.androgen-independent prostate cancers stained positive 
for PSCA protein and mRNA. Eight specimens were obtained from patients 
managed prior to androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the strongest staining 
category (score = 9), compared with three out of eight (37.5%) of patients 
with ahdrogen-dependent cancers (p < 0.05). The results demonstrate that 
PSCA expression increases significantly with progression to androgen- 
independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates 
significantly with increasing grade, worsening stage and progression to 
androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the results of PSCA IHC 
staining with mRNA ISH analysis. Positive staining areas and its intensity 
and density scores evaluated by IHC were identical to those seen by JSH in 
79 of 88 (89.8%) specimens (18/20 BPH, 3 9/20 PIN and 42/48 Pea 
respectively). Importantly, 27/27 samples with PSCA mRNA composite 
scores of 0-2, 32/36 samples with scores of 3-6 and 22/24 samples with a 
score of 9 also had PSCA protein expression scores of 0-2, 3-6 and 9 
respectively. However, in 5 samples with PSCA mRNA overall scores of 3-6 
and in 2 with scores of 9 there were less or negative PSCA protein 
expression (i.e. scores of 0-4), suggesting' that this may reflect 
posttranscriptional modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data demonstrate that 
the results of PSCA immunostaining were consistent with those of mRNA 
ISH analysis, showing a high degree of correlation between PSCA protein 
and mRNA expression. 
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PSCA is homologous to a group of cell surface proteins that mark the 
earliest phase of hematopoietic development. PSCA mRNA expression is 
prostate-specific in normal male tissues and is highly up-regulated in both 
androgen- dependent and- independent Pea xenografts (LAPC 4 tumors). We 
hypothesize that PSCA may play a role in Pea tumorigenesis and 
progression, and may serve as a target for Pea diagnosis and treatment. In 
this study, IHC and ISH showed that in general there were weak or absent 
PSCA protein and mRNA expression in BPH.and low grade PIN tissues. 
However, PSCA protein and mRNA are widely expressed in HGPIN, the 
putative precursor of invasive Pea, suggesting that up-regulation of PSCA is 
an early event in prostate carcinogenesis. Recently, Reiter RE et al [1], 
using ISH analysis, reported that 97 of 118 (82%) HGPIN specimens stained 
strongly positive for PSCA mRNA. A very similar finding was seen on mouse 
PSCA (mPSCA) expression in mouse HGPIN tissues by Tran C. P et al [S]. 
These data suggest that PSCA may be a new marker associated with 
transformation of prostate cells and tumorigenesis. 



. We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly 
differentiated, androgen-independent and metastatic cases. Fluorescence- 
activated cell sorting and confocal/ irnmunofluorescent studies 
demonstrated cell surface expression of PSCA protein in Pea cells [9]. Our 
IHC expression analysis of PSCA shows not only cell surface but also 
apparent cytoplasmic staining of PSCA protein in Pea specimens (Figure I). 
One possible explanation for this is that anti-PSCA antibody can recognize 
PSCA peptide precursors that reside in the cytoplasm. Also, it is possible 
that the positive staining that appears in the cytoplasm is actually from the 
overlying cell membrane [5). These data seem to indicate that PSCA is a 
novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high 
grade (i.e. poor differentiation), increased tumor stage and progression to 
androgen- independence of Pea. These findings support the original IHC 
analyses by Gu 2 et al (9], who reported that PSCA protein expressed in 
94% of primary Pea and the intensity of PSCA protein expression increased 
with tumor grade, stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al [3 0], in which the 
significant association between high PSCA expression and adverse 
prognostic features such as high Gleason score, seminal vesicle invasion and 
capsular involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Hara H et al [ J 1 ] used RT-PCR detection of 
PSA, PSMA and PSCA in 1 ml of peripheral blood to evaluate Pea patients 
with, poor prognosis. The results showed that among 58 PCa patients, each 
PCR indicated the prognostic value in the hierarchy of PSCA> PSA > PSMA RT- 
PCR, and extraprostatic cases with positive PSCA PCR indicated lower 
disease- progression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be- used as a prognostic factor. Dubey P et al 
|].7] repotted that elevated numbers of PSCA t- cells correlate positively 
with the onset and development of prostate carcinoma over a long time 
span in the pi estates of the 1 RAMP and PTL : N -t/- models compared with its 
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normal prostates. Taken together with our present findings, in which PSCA 
is overexpressed from HGPIN to almost frank carcinoma, it is reasonable . 
and possible to use increased PSCA expression level or increased numbers 
of PSCA-positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility 
that PSCA may have diagnostic utility or clinical prognostic value in human 
Pea. 

The cause of PSCA overexpression in Pea is not known. One possible 
mechanism is that it may result from PSCA gene amplification. In humans, 
PSCA is located on chromosome 8q24.2 [1], which is often amplified in 
metastatic and recurrent Pea and considered to indicate a poor prognosis 
[13-15]. Interestingly, PSCA is in close proximity to the c-myc oncogene, 
which is amplified in >20% of recurrent and metastatic prostate cancers 
[ 1 6, 1 7). Reiter RE et at [ 1 8] reported that PSCA and MYC gene copy 
numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coamplification in Pea. Gu 2 et al [9] recently reporteted that in 102 
specimens available to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had identically positive areas 
of PSCA protein and mRNA expression. Taken together with our findings, in 
which we detected moderate to strong expression of PSCA protein and 
mRNA in 34 of 40 (85%) Pea specimens examined simultaneously by. 1HC 
and ISH analyses, it is demonstrated that PSCA protein and mRNA 
overexpressed in human Pea, and that the increased protein level of PSCA 
was resulted from the.upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and its 
biological function are yet to be elucidated. PSCA expression may be 
regulated by multiple factors (13]. Watabe T et al [3 9] reported that 
transcriptional control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be regulated or 
mediated through cell-cell contact and protein kinase C (PKC) [20]. 
Homologues of PSCA have diverse activities, and have themselves been ' 
involved in carcinogenesis. Signalling through SCA-2 has been demonstrated 
to prevent apoptosis in immature thymocytes [2J]. Thy- 3 is involved in T 
cell activation and transducts signals through sre-tike tyrosine kinases [22].. 
Ly-6 genes have been implicated both in tumorigenesis and in cell-cell 
adhesion [23 25]. Cell-cell or ce!l J matrix interaction is critical for local tumor 
growth and spread to distal sites. From its restricted expression in basal 
cells of normal prostate and its homology to SCA-2, PSCA may play a role in 
stem/progenitor cell function, such as self-renewal (i.e. anti-apoptosis) 
and/or proliferation [)]. Taken together with the results in the present 
study, we speculate that PSCA may play a role in tumorigenesis and clinical 
progression of Pea through affecting cell transformation and proliferation. 
From our results, it is also suggested that PSCA as a new cell surface 
antigen may have a number of potential uses in the diagnosis, therapy and 
clinical prognosis of human Pea. PSCA overexpression in prostate biopsies 
could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA- over expressing 
cells in bone marrow or peripheral blood may identify and predict metastatic 
progression better than current assays, which identify only PSA- positive or 
PSMA- positive prostate cells. 
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In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGPIN through all stages of Pea in a majority 
of cases, which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell differentiation), 
advanced stage and androgen- independent progression. PSCA protein 
overexpression is due to the upregulation of its mRNA transcription. The 
results suggest that PSCA may be a promising molecular marker for the 
clinical prognosis of human Pea and a valuable target for diagnosis and 
therapy of this tumor. 
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Expression of superoxide dismutases, catalase, and glutathione peroxidase 
in glioma cells. 
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Pour primary antioxidant enzymes were measured in both human and rat glioma cells 
Both manganese-containing superoxide dismutase (MnSOD) and copper-zinc-containing 

ceU hues. MnSOD was generally h.gher in human glioma cells than in rat glioma cells 
and relauvely hxgher than ,n other tumor types. High levels of MnSOD in human glioma 
ed s were due to fee 1^^ g ° ma 

Heterogeneous expression of MnSOD was present in individual glioma cell lines and 
r^ntSn S " b P° pulatio , nS ° r cdIs at different differentiation stages. Less difference 
Zuli ? ' , aS6 ' ° r ^ thioae P eroxid « was found between human and rat glioma 
cells. The human ghoma cell lines showed large differences in sensitivity to the 
glutathione modulating drugs 1,3-bis (2-chloroethyl)-l -nitrosourea (BCNU).and 
buthtonme^lfoximine (BSO). A good correlation was found between sensitivity to 

bSI ? a T? 9f "talase ^ecell lines. Only one cell line was sensitive to 
BSO and this line had low CuZnSOD activity. 

PMID: 10641728 [PubMed - indexed for MEDLINE] 
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[Expression ofhumau teloiuerasc reverse transcriptase in cervix cancer 
and its significance! 

[Article in Chinese) 

2£LL Zim T . Wu P . XuQ . Huang L , UKZ, Lu YP , Ma D 

Department of Obstetrics and Gynecology, Tonga i Hospital, Tongji Medical College, 
Huazhong University of Science and Teclinology, Wuhan 430030, China. 

OBJECTIVE: To investigate the expression of human telomerase reverse transcriptase 
(hTERT) mRNA and protein in cervix cancer, cervical intraepithelial neoplasia (CIN) 
and normal cervix. METHODS: Expression of hTERT mRNA and the other two subunits 
of telomerase, human telomerase RNA component (hTR), human telomerase-associated 
protein (hTPl) was determined by RT-PCR in 3 cervix cancer cell lines, 2 diploid cell 
lines, 38 cases of cervix cancer, 16 cases of CIN and 20 cases of normal cervix. 
Telomerase activity was also examined by.telomeric repeat amplification protocol 
enzyme-linked immunosorbent assay (TRAP-ELISA). Expression of hTERT protein was 
detected in all the cell lines and 101 cases of paraffinized cervix tissue sections. 
RESULTS: hTERT mRNA expression was detected in ail of the three cervix cancer cell 
hues, 81.6% of cervix cancer, 37.5% of CIN, 5-0% of normal cervix, while in neither of 
the two diploid cell lines. The other two subunits of telomerase were prevalently 
expressed in all of the cell lines and most cervix tissues. There was a strong correlation 
between hTERT oiRNA expression and telomerase activity. Dnmunostaining also 
revealed that hTERT protein was expressed in all three cervix cancer cell lines 65 5% of 
cervix cancer, 28.0% of CIN and 4.8% of normal cervix, CONCLUSION: Up -regulation 
of hTERT may play an important.role in the development of CIN and cervix cancer, 
hTERT could be used as an early diagnostic biomarker for cervix cancer. 

PMED: 16008894 [PubMed - in process] 
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ORIGINAL ARTICLE 

Gene expression signatures and biomarkers of noninvasive and invasive 
breast cancer cells: comprehensive profiles by representational difference 
analysis, microarrays and proteomics 

GM Nagaraja 1 , M Othman 2 , BP Fox 1 , R Alsaber 1 , CM Pellegrino 3 , Y Zeng 2 , R Kbanna 2 , 
P Tamburini 3 , A Swaroop 2 and RP KandpaP 

'Department of Biological Sciences, Fordham University, Bronx, NY, USA; department of Ophthalmology and 
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>Ve have characterized comprehensive transcript and 
proteomic profiles of cell lines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNXl The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCRU, cadherin 11, annexin 
Al, vimentin, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid /( pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-specific 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell protiferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80kDa 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 

Oncogene (2006) 25, 2328-2338. doi.10.1038/sj.onc.1209265; 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes CPerou et al, 2000; Becker et aL, 2005). The 
expression of some of these genes is a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et al, 1995; Alpan et aL, 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan etaL, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their expression patterns in 
phenotypically related cell lines (Khan et aL, 2001; 
Dan et aL, 2002; Rosenwald el aL, 2002; van't Veer 
et aL, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et ah, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in different proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels has not 
yet become routine. , 

Although expression profiling of tumor tissue and its 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a tumor 
type, such comparisons have not been free of attendant . 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to predict tumorigenic 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line^ a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-231 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-231 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either 
upregulated in or specific to MDA-MB-231. The initial 
linkers used in this protocol had internal BgRl sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with BglTl, a dephosphorylated 
BgM adaptor was ligated to tester DNA. The BgUI 
adaptor had an internal EcoRl site. The difference 
product was digested with EcoRl and cloned in pBlue- 
Script vector. The cloning efficiency of the difference 
product was very low (5 x 10 4 c.f.u./^g of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1. The 



involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-231 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2+ binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. ; 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the cell line that was used as 
a tester. Such analyses demonstrated that more than 
90%: of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
Hie pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in 
MCF7 and MDA-MB-231 cells that were used as 
driver/tester combinations for the RDA. 



Gene mict oat rays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-231 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that are either present or absent as shown in Figure 3 
revealed that a set of 123 genes distinguishes JMDA-MB- 
23 1 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these dasses were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes show changes varying between two- and 
fourfold (Figure 5). Interestingly, with all comparisons 
combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated and downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-231 cells. These 
transcripts include GPCR11, cadherin 11, annexin 
Ai, vimentin, lactate dehydrogenase B (upregulated 
in 'MDA-MB-231) and GREB1, S100A8, amyloid 
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Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downregulated in MDA-MB-231 



Extracellular matrix/ mat rix-crosslinking proteins 
LOX 

Laminin pi 
Collagen VI-a 1 

Calcium-binding proteins 
Reticulocalbin I 
S100A8 
CuIIinS 

Transcription factors/promoter-binding proteins 
RUNX2 
c-Jun 
Fra-1 

Cell-cell adhesionj cell-surface receptor proteins: 

Cdhll 
* CYR61 

MHC class II antigen y chain 
Protease-activated receptor- 1 
Pro tease-activated receptor-4 



ATPase/GTPase and signal transduction proteins 
ATP7a 
Caveolin 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Metalloproteases and MMP inhibitor proteins 
TIMP-2 
MT1-MMP 

Stress-response proteins 
Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell-cycle regulation and grornhf differentiation) apoptosis proteins 
Cyclin Bl 
Cyclin E 
Cyclin A2 
Bcl2-Iike 1 protein 



Secreted proteins and growth factors 
Milk fat globule protein 
TGFhx 

SMAD-specific E3 ub ligase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
Calgranuiin B 

Transcription factors! promoter-binding proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

* 

Cell-cell adhesion/cell-surface receptor proteins 
Claudin 3 

Amyloid p precursor protein 

Triose phosphate isomerase 

Plakoglobin 

Cdh 1 

Cdh3 

Annexin A9 

RAR-ot . 

Connexin 31 

ATPase/GTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-0 Rl 
Stress-response proteins 

Protein kinase H 1 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin S 1 

Microtubule-associated protein t 

Cell-cycle regulation and growth/dijferentiationfapoptosis proteins 
S100A13 
S100C 

Aurora kinase AIK2 
Nucleoside diphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor I (TFF1) 

Four and a half LIM domainl 

Solute carrier family 16 SLCI6A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINT1) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FU22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



$ precursor protein, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCplOA may be made based on keratin 19, 
serine protease, amyloid f$ precursor, neuropeptide Y ? 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin &A1, epithelial membrane protein 1, S100A2, 



keratin 15 (downregulated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 11, GPCR 
116, collagen type XIII a 1, Bcl2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatin A, cadherin 1, CD24, calcium-activated chlor- 
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ide channel, SI OOP, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 2+ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 



MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2). The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2+ homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 
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Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCFJOA (lanes 1 and 4), MCF7 (lanes 2 and 5) ox MDA-MB- 
231 (lanes 3 and 6) cells. TTie amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes 1 and 2), TGF-oc (lanes 3 and 4), 
Moesin (lanes 5 and 6), LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes 1 1 and 12), S100A13 (lanes 13 and 14), TFF3 
(lanes 15 and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes I, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, 11, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing PCR 
products from MDA-MB-231 cells are marked with an asterisk. 
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Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINT1 (c) and 
DUSP (d) were labeled with a "P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane 1), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure 1 . 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
Affymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. 
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Gene Ontology 
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Figure 4 FunctionaJ classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 
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Figure 5 Distribution of altered number of transcripts as a 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Cadherin 4 


Claudin3 


Cadherin 11 


Cadherin- 1 


Catenin 


Cadherin- 3 


Integrin a 6 


Cadherin- 18 


Transmembrane anchor protein 


Cadherin, LAG seven pass receptor 


Eph B2 


Down syndrome cell adhesion 


Dystonin 


Catenin-<52 


Laminin 01 


Eph A4 


Lamin 


Ephrin A4 


Filamin B 


Annexin A9 


Filamin C 


Ankyrin 3 


Tail in 1 


Sarcoglycan 


Butyrophilin 


Keratin 8 


Spectrin-ot 


MAP-7 


Spectrin-/? 


MAP-x 


Thrombospondin 


Plakoglobin 


Plastin 3 


Plakophilin 


Adducin 3 y 


Discoidin domain receptor 


Lamin B receptor 


Zona occludens 3 


Laminin p 2 


Periplakin 


Lamin A/C 


Protocadherin « 9 


Laminin y 2 




Laminin d 3 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gels, and there were far fewer protein spots in 
gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 had ll unique protein 
spots, while MDA-MB-231 had 15 spots that were not 
seen in MCF-10A. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein- tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
involved in calcium homeostasis such as crocalbin, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered levels of Rho GDP dissociation 
inhibitor 1, an apoptosis/difFerentiation regula- 
ting protein galectin, Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein Jstathmin. The translation initiation 
factors 5A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 

i 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Reticulocalbin 
Dystonia 
Follistatin like 1 
Cullin 

Annexin A5 

EF hand domain containing-2 

Hippocalcin Uke-2 

LDLR 

Steroid sulfatase 

MT-actin crosslinking factor 

Inositol- 1 ,4,5-triphosphate 

receptor 3 

Sorcin 

Guanine nucleotide-binding 

protein-y 

GAS6 

SWP-70 protein 
Jagged 1 

EGF-containing fibrulin-like 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-a 

Caimegin 

Calpain 

HEG homolog 

Cyr61 



S100A8 
S100A7 
S100A13 

Tumor associated Ca 1 * 
signal transducer 
Notch homolog 3 
PKD2 

Adenylate cyclase 
Phospholipase-G 
Chemokine Jigand 12 
Ubquitin-specinc protease 
PKC->f 

Ret-protooncogene 

Signal peptide-CUB domain 

Mannosidase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca a * channel voltage-dependent /ft 

Regulator of G protein signaling 17 

Dystrobevin-a 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member Dl 

PR-cyclic AMP dependent 

ATPase-Ca ,+ transporting 

Calmodulin 1 

CaM kinase 



Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins 



Rho signaling proteins 



LOX 

LOX-1, LOX-2 
SCO cyt oxidase-deficient 
homolog 2 
COX 17 homolog 
Metallothionein IE, IF and 2a 
Ring finger protein7 
Arniloride-binding protein 1 
Neurotrypsin/motopsin 



Rho3 

Rho/Rac GEF 18 
Rho GEF 12 

Ras related C3 botulinum toxin 
substrate 2 

Cdc 42 effector protein 3 
Rho GEF 3 

Rho GDP dissociation inhibitor fi 
RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transfection with antisense 
constructs of selected transcripts 

We had observed significant upregulation of transcripts 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI 1 
and secreted protein XAG in MCF7 cells. We reasoned 
that antisense knockdown of the above genes in MCF7 
cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 cells were transfected with empty vector 
pCDNA3.1 or antisense constructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



Table 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 cells 



Cell-cell adhesionj cell-surface 
receptor proteins 
Triose-phosphate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cytoskeletal component/binding 
proteins 
Stathmin 

Celt- cycle regulation and growth/ 
differ entiationjapop to sis proteins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
. Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase VIb 
.Peptidyl-prolyl cis-trans isomerase 
Ubiquitin 



Calcium-binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticulocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Celt-cell adhesion i 'cell-surface 
. receptor proteins 
Galectin 

A TPaselGTPase) 'signal trans- 
duction/ trqfficing proteins 

Rho GDP dissociation 

inhibitor 1 

Stress-response proteins 
Hsp 70 

Peroxiredoxin 2 

Cytoskeletal component/ 
binding proteins 
Stathmin 

Miscellaneous 
Heterogeneous nuclear 
ribonudeo protein H 
eIF4H {translation) 
elF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 
proteins included stress-response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 
teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The number of altered proteins varied between 5 and 15 
for various transfections. Peptidyl prolyl cis-trans 
isomerase, calcium-regulating proteins, SOD, galectin, 
histidine triad protein and PKC substrate were promi- 
nent among altered proteins. We performed database 
searches to identify interactors for all proteins that were 
altered in transfected cells and identified nearly '350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. » 
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Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 



After transfection with as-trefoil factor 3(TFF3) 



After transfection with as-protein kinase Ell 



Calcium-binding proteins 

Calmodulin 
Cell-cell adhesion/ cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPase/signal transduction/ trajficing proteins 

Raf kinase inhibitor 
Stress-response proteins 

Hsp27 

Peroxiredoxin 1 
Cytoskeletal component /binding proteins 

Cofilin-nonmusde isoform 
Cell-cycle regulation and growth/dijferentiation/apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

Translationally controlled tumor protein 
Miscellaneous 

40S ribosomaj protein S12 

ATP synthase D chain 

Cancer-associated Sm-like protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide-binding protein 

HistoneH2B.n 

Peptidyl-prolyl isomerase 

Proteosome subunit /J- type 6 

RNA-binding protein 8A 

Thioredoxin 

Ubiquitin crossreactive protein 

After transfection with as-aurora kinase AIK2 

Calcium-binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 

60S acidic ribosomal protein P2 
Peptidyl-prolyl isomerase 
Ubiquitin-crossreactive protein 



CeUr-ceU adhesion\cell-surface receptor proteins 
Galectin 

Retinoic acid-binding protein II 
ATPase-GTPase/signal transduction/ trqfficing proteins 

PKC substrate 
Stress-response proteins 

Hsp27 

Cytoskeletal component /binding proteins 

Actin-a, skeletal muscle 

Cytokeratin 18 
Miscellaneous 

30H-acyl CoA dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

Ubiquinol-cyt C reductase 
No match 

After transfection with as-ONA replication complex GINS PSF2 
Calcium Binding proteins: 
CaJreticulin 

ATPase-GTPase/signal transduction/ trajficing proteins 

14-3-3 i; 

PKC substrate 
Cytoskeletal component/binding proteins: 

Cytokeratin 1 

Cytokeratin 18 

Cell-cycle regulation and & growth/ differentiation/ apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyl-prolyl isomerase 

Ubiquinol-cyt c reductase 

After transfection with as-human homolog ofXAG 
Calcium-binding proteins 

CaJreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain . . 

Peptidyl-prolyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
enough to yield important genes that show significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Baskaran et a!., 1996; Jacob 
et al, 1997). The detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyl oxidase and a 
copper transporter ATP7 A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et ah, 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin-dependent kinase inhibitor 
(p21 CIP1) (Westendorf et ah, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of MDA-MB-231 (Fox and Kandpal, 
2004; unpublished observations). Lysyl oxidase, a 
copper-sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize collagen fibrils (Siegel, 1976) that are 
found in invasive breast carcinoma cells (Akiri et ah, 

2003) . The activation of LOX is dependent on qopper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase et.ah, 

2004) , a protein mutated in Menkes disease (Molfer 

" et ah, 2005). Prion protein has also been repbrted as a. 



Molecular signatures of breast carcinoma cells 
GM Nagaraja et at 



2335 



chaperone for copper (Jones et al, 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TlMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al, 2004; Shao et al, 2005) and promoter 
methylation (Costa et a/.,. 2004). As several transcripts, 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in 1 particular. The changes in cell-cell interaction 
correlate to cell pheno types because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kaftdpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et al, 1 983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et al 7 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et al, 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et al, 1995). Calcium ions act 
as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et al, 2000) and are known 
to alter .caicineurin to activate transcription factors such 
as NFATc (Luo et al, 1996). 

As dictated by post- transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
genes in MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase HI 1 and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Yu 
et al, 2001; Fletcher et al, 2003; Katoh, 2003; Warner 
et al, 2003; Takayama et al, 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT-PCR of RNA isolated from the transfected cells. 
The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable to their levels in MDA-MB-231 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase 'inhibitor, histidine triad nucleo- 
tide-binding protein and peptidy-prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et aL, 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pin 1 has been 
related to p53-mediated signaling pathways (Mantovani 
et al, 2004; Berger et al, 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005)* Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in . 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA-based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUNX2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a cell line established, from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB- 
231 that vary in their in vitro and in vivo invasiveness. All cells 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM:F12 media (Gibco) with 5% horse serum (Gibco), 
20mM HEPES, lOng/ml EGF (Invitrogen), 10 ml/1 PehStrep- 
Glutamine (lOOOOU/ml penicillin, 10000/^g/ml streptomycin 
and 29.2mg/ml L-glutamrne), 10/ig/ml insulin (Invitrogen), 
0.1/ig/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-231 cells were grown in 
DMEM (Gibco) supplemented with 2mM L-glutamine (Gib- 
co), 1 mM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5000^g/ml streptomycin), and 10% fetal 
bovine serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85 to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 100% ethanol, then dried and resuspended in 
DEPC-treated water. RNA was stored in aliquots at -70°C. 
The quality of RNA was visualized by running on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance at 
260 nm. 

DNAase treatment of total RNA 

To remove DNA contamination, 20 /xg of RNA (quantified 
specuophotometrically) was treated with fOOng DNAase I; 
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80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50 ul. The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65°C for 30 min. 

Representational difference analysts (RDA) 
RDA of cDNAs is a modification of genomic RDA (Lisitsyn 
et aL, 1993). We performed RDA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et al., 1997). Briefly, first-strand synthesis was carried out using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a BgUl site was ligated to the tester as 
well as the driver cDNA. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
Bglil and the digested cDNA was gel purified. A second set of 
unphosphorylated BgUl adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized in a 
5 /tl reaction volume at a ratio of 1 :40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1/il of the reaction mixture was diluted to 
100 /d. The difference product was obtained by amplifying 1 ul 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoBl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger's 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 



Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (AfTymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22000 probe sets corresponding to 18400 
transcripts and variants, including 14500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in \2u\ RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY, 
USA) with 6 u\ of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-ll-CTP and bio-16-UTP. The 
biotinylated RNA was purified by using an affinity column 
(Qiagen, Valencia, CA, USA) and fragmented randomly, by 
heating to 95°C in the presence of fragmentation buffejr, . 
between sizes of 35 and 200 bases. The GeneChips were ; 
hybridized overnight at 45 e C in hybridization oven in a. 
solution containing fragmented cRNA, control oligonucleo- ( 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, acetylated BSA and 2 x hybridization buffer. The 
GeneChips were washed and stained with streptavidin- 
ptiycoerythrin and the antibody in 2 x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6 um with 
a Affymetrix GeneChip scanner 3000. AfTymetrix MICRO- 
ARRAY SUITE 5 was used for initial data preparation 
(generation of .CHP files). Normalization (quantile method) 
and calculation *of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/).- For every cell line, three replicates were ? 
performed with Affymetrix Gene Chips. The (Gene Chip data 
were use$ for fur|her calculations after the raw 1 image and 



MAS5 analysis revealed a positive quality report. Ratios of 
average signal intensity (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was per- 
formed on probe sets as described (Yoshida et al., 2004). The 
statistical method used to assign f-values to the fold changes 
of gene responses is described by Yoshida et al. (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et aL, 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geneontology.org) or Locuslink (http://www.ncbi.nlm-nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript II (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the V 
untranslated region using Primer 3 software. PCR i tactions 
were then performed in triplicates in an I-cycler Thermocycler 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fold changes were calculated by differences in threshold cycles 
(Ct) between pairs of samples to be compared. HPRTgmz was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion. The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysis 

The expression pattern of selected transcripts in cell lines was 
also analysed in Northern blots. RNA (20 fan), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65°C 
for 12-1 5 h with a radioactive probe and the blot was 
subsequently washed with 0.1 x SSC and 1% SDS at 65°C 
The hybridized probe was detected by autoradiography. 

Transfection of MCF-7 cells with antisense constructs 
The genes selected on the basis of their upregulation were 
cloned in antisense orientation in pCDNA3. 1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
Approximately 4 fig of DNA was transfected into MCF-7 
cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400 fig/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcD^JA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered v 
bands were excised for mass spectrometry. 
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Protein isolation 

The cultured ceils were harvested by trypsinization and 
centrifuged at 220 £ for 5min at 4°C. The cell pellet was 
washed once with ice-cold 1 x PBS. The proteins were isolated 
by using a commercial kit (BioRad, Hercules, CA, USA). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors cocktail 
(Roche), vortexed and stored on ice for 30min. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at 100 g for 
lOmin at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75 ml PSB buffer, vortexed briefly and centrifuged at 1000 £ 
for lOmin at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating" 2D/IEF 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis 

The protein mixtures were separated based on isoelectric 
points by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
sample (175 /ig) in 185 /d of sample buffer (8M urea, 2% 
CHAPS, 0.2% biolytes, 3/10 ampholytes, 65 mM DTT and 
0.002% brombphenol blue) was loaded in the sample loading 
trays at the end of 1 1 cm immobilized rehydxated strips (pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15min treatments with equilibration buffer (6 M urea, 0.375 M 
Tris-HQ pH 8.8, 2% SDS, 20% glycerol, 2% DTI) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 



Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein lysates from MCF- 
10A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in lOOmM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 /d of 
Promega's autocatalysis-resistant trypsin (12.5ng//d in 50 mM 
ammonium bicarbonate and 5 mM CaCl 2 , pH 8.0) overnight at 
37°C. The supernatant (5 jd) from tryptic digest was injected 
for peptide sequence. anaiysis-Jising on-line capillary liquid 
chromatography-electro spray ionization-tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300 /mi inner diameter 
x 15mm C18 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1 100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 pl/min. Peptides were detected at 215 nM. The in- 
line mass spectrometer was a Thermo Electron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a nonredundant protein 
database using the Sequest program. 
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Identification of androgen-regulated genes in the 
prostate cancer cell line LNCaP by serial analysis 
of gene expression and proteomic analysis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer line LNCaP, at both RNA and protein levels. 
Changes at the RNA level induced by DHT were determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacrylamide gel electrophoresis. Among 123371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 1 6 570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
(p < 0.05), of which 147 were induced and 204 repressed by androgen, in two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots anaiyzed. The change in intensity for mast of trie affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro- 
vides a global assessment of genes regulated by DHT and suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 

Keywords: Androgen / Prostate / LNCaP / Gene expression / Serial analysis of gene expression 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AB) lead to ligand- 
independent activation or promiscuity of the receptor 
[1 , 2]. The ability of the receptor to activate or repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3, 4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 



28-30% of tumors that recurred post androgen-ablation 
therapy [6, 7]. Wallen ef a/. [8] have recently shown that 
one-third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator 
amplification and corepressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathways) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer cejl line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 



Correspondence: Dr. Y. Q. Chen, Department of Pathology, 
Wayne State University, 540 E. Canfield, Detroit, Ml 48201 , USA 
E-mail: yqchen@med.wayne.edu 
Fax:+1-313-5770057 

Abbreviations: DHT, dihydrotestosterone; G3PD, glyceralde- 
hyde-3-phosphate dehydrogenase; PSA, prostate specific anti- 
gen; RT-PCR, reverse transcrfption-pofym erase chain reaction; 
SAGE, serial analysis of gene expression 



2.1 Prostate cell line and DHT treatment 

LNCaP human prostate cancer cell line (American Type Cul- 
ture Collection, Rockvitle, MD, USA) was cultured for 3 d in 
phenol-free RPMI 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Half of the cul- 
tures were then treated with 1 0r 9 m DHT for 24 h. Total RNA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrogen, Carlsbad, CA, USA) reagent 
for RNA and a solubilization cocktail for proteins, consisting 
of 9 m urea, p-mercaptoethanol and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 1C 9 m 
DHT and total RNA was extracted at different time points 
from 0^96 h. To determi ne the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 ng/mL of cyclohexamide in the presence or absence 
of 1 0" 9 m DHT for 24 h and total RNA was extracted. 

2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [11] with the following modifi- 
cations: ditags^erePCR ampiified using biotinylate^Tpri- 
mers and digested with NlalW enzyme [12J. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacryiamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invitrogen). Concatenated tags 
were screened by PCR using M13 forward and M13 
reverse primers. PCR products with inserts greater than 
500 bp were isolated and sequenced with M13 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Elmer, Norwalk, CT, USA). For microSAGE, 1 jig of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the mRNA Capture Kit (Boehringer Mannheim, 
Indianapolis, IN, USA. cONAs were cleaved with Affaltl, 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (httpy/www.ncbt.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis- 
carded. Only tags that match to a single gene were tabu- 
lated and composite counts analyzed for their signifi- 
cance; (2) inclusion method: tags that match to multiple 
genes were counted at 100% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes fcxO.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have a p value <0.05 in both lists were considered statis- 



tically significant. The total number of genes identified 
was estimated by N m + (N^ - 0.1 where N m is 

the number of genes matched to SAGE tags, N^ is the 
number of SAGE tags that do not match to known genes 
or ESTs, with 10% representing the estimated sequencing 
error per SAGE tag and 3.5, the average number of tags 
per gene in the SAGE reliable map (release 10-26-2000). 

2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PCR (RT-PCR), 1 \ig of total RNA 
was reverse transcribed into cDNA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVision 760 Image Workstation (Nucleotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RT-PCR was done at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the iCycler (Bio- 
Rad, Hercules, CA, USA). Briefly, one fortieth of cDNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 mm 
MgCfe, 0.1 mwi dNTP, 200 nM primers (listed below), 0.05 U 
platinum Taq polymerase (Invitrogen) and 0.1 x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR was carried out 
at 94*0 for 2 min, and 40 cycles of 94°C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
quantification: PSA (prostate specific antigen) (Hs.i 71 995, 
5 '-GG AAATGACC AGGCC AAG AC-3 \ 5 ' -C AACCCTG GAC 
C TCACACCTA-3-), SCMH1 (Hs.57475, 5*-GCCTTGACC 
ACATC^CTCCAT-S', 5'-AGGCCTAGGGCTGCAAAAG-3'), 
and clusterin (Hs.75106, 5 '-GCAGG AATACCGCAAAAA 
GC-3*, 5'-GACTCAAGATGCCCCCGTAAG-3'). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0.5 jiLcDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in quadruple reactions. Standard 
curve (Ct = mX + B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample B. Cta = mlogQa + B or Qa = 
10 (cta-Byn\ therefore, Qa/Qb = 1 0^^^. 

2.5 2-D PAGE 

The procedure followed was as previously described 
[14], Cells were solubilized in 200 \il of lysis buffer con- 
taining 9.5 m urea j(Bio-Rad), 2% NP-40, 2% carrier 
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ampholytes pH 4-8 (Gallard/Schlessinger, Carle Place, 
NY, USA), 2% p-mercaptoethanol and 10 mM PMSF. 
Aliquots containing approximately 5x10 6 cells, were 
applied onto isofocusing gels. IEF was conducted using 
pH 4 to 8 carrier ampholytes at 700 V for 16 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 125 mM Tris, pH 6.8, 
10% glycerol, 2% SDS, 1% DTTand bromophenolblue. 
For the second dimension separation, a gradient of 
11-14% of acrylamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Miflipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silverstained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1% silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
from the 2-D gels and destained for 5 min in 15 mM 
potassium ferricyanide and 50 mM sodium thiosulfate 
as described [15]. Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 100 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 mM ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. Ttw Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37°C f the peptides were extracted twice 
with 50 nL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager-DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyrne peptide mass standards kit 
(PerSeptive Biosystems). The search program MS-Fit, 
developed by the University of California at San Francisco 
(http://pnospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 



pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
tion as described [16]. In some cases, the amino acid 
sequence of some peptides of interest was determined 
by ESI MS analysis. 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 62 878 were from 
the LNCaP cell line and 60493 from the LNCaP cells 
treated with DHT for 24 h (Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 1 1 243 from LNCaP and 12 203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched GenBank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 1 47 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 149 were changed £ five-fold by DHT treatment 
(Table 3). 



Table 1. Summary of SAGE analysis in LNCaP cells 



Sample 


Total 

transcripts 


Distinct 
tags 


Number 
of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18 510 


12 203 


Total 


123 371 


28 844 


16 570 



- DHT: LNCaP cells without dihydrotestosterone; +> DHT: 
LNCaP cells treated with 10-® m dihydrotestosterone for 
24h 



Table 2. DHT regulated genes in LNCaP cells 

Genes LNCaP vs LNCaP + DHT 

(gene#) 



p 0.05 p 0.01 



Known genes 


277 


147 


ESTs 


29 


8 


No match 


45 


13 


Total 


351 


168 


Up-regulated by DHT (24 h) 


147 


65 


Down-regulated by DHT (24 h) 


204 


103 
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UGDorTag CAP DHT Change P value Description 



283305 


1 


82 


Induced 


0 


immunoglobulin heavy contant alpha 1 


183 752 


0 


42 


Induced 


0 


microseminoprotein, beta- 


140 


1 


44 


Induced 


0 


immunoglobulin heavy constant gamma 3 (Gm marker) 


75415 


18 


95 


Induced 


0 


beta-2-microglobulin 


84298 


0 


17 


Induced 


1.00E-05 


CD74 antigen (invariant polypeptide of major histocompability 
complex, class II antigen-associated) 


9615 


o 


16 


Induced 


1.00E-05 


myosin regulatory light chain 2 ( smooth muscle isoform 


77443 


0 


16 


. Induced 


1 .00E-O5 


actin, gamma 2, smooth muscle, enteric 


1119 


0 


15 


Induced 


2.33 E-05 


nuclear receptor subfamily 4, group A, member 1 


84753 


23 


2 


Repressed 


3.00E-05 


KIAA0246 protein 


75777 


0 


14 


Induced 


3.66E-05 


transgelin 




0 


14 


Induced 


3.66E-05 


immunoglobulin J polypeptide, linker protein for immuno- 


76325 










giODUiin aipna ana mu poiypeuaes 


1852 


1 


17 


Induced 


4.33 E-05 


acid phosphatase, prostate 


78465 


0 


' 13 


Induced 


9.00 E-05 


v-jun avian sarcoma -vims-r; encogehetiomciog 


78344 


1 


16 


Induced 


1 .20E-04 


myosin, heavy polypeptide 1 1 , smooth muscle 


83006 


19 


2 


Repressed 


1 .50E-04 


CGM39 protein 


263812 


15 


1 


Repressed 


3.19E-04 


. nuclear distribution gene C (A. nidulans) homolog • ■ 


TACGGGGATA 


0 


11 


Induced 


4.20E-04 


Novel 


GCCTGGGTGG 


11 


0 


Repressed 


6.63E-04 


Novel 


GACTGACACT 


16 


2 


Repressed 


8.40E-04 


Novel 


284296 


0 


10 


Induced 


8.60E-04 


Homo sapiens SURF-4 mRNA, complete cds 


75105 


13 


1 


Repressed 


1.19E-03 


emopamil-binding protein (sterol isomerase) 


128075 


10 


0 


Repressed 


1.30E-03 


ESTs 


154162 


15 


2 


Repressed 


1.43E-03 


ADP-ribosytation factor-like 2 


6895 


1 


12 


Induced 


1.49E-03 


actin related protein 2/3 complex, subunit 3 (21 kD) 


143240 


17 


3 


Repressed 


1.63E-03 


ESTs 


211582 


0 


9 


Induced 


1.66E-03 


myosia Wght polypeptide kinase 


180266 


0 


9 


Induced 


1.66E-03 


tropomyosin 2 (beta) 


93002 


14 


2 


Repressed 


2.53E-03 


ubiquitin carrier protein E2-C 


17883 


14 


2 


Repressed 


2.53E-03 


protein phosphatase 1G (formerly 2C), magnesium- 










dependent, gamma isoform 


126023 


14 


2 


Repressed 


2.53E-03 


ESTs, Highly similar to NTC1JHUMAN NEUROGENIC LOCUS 










NOTCH PR0 1 1 IN HUMULUb 1 rnfcL/lmoUrl 


69469 


2 


13 


Induced 


2.92 E-03 


dendritic cell protein 


77899 


2 


13 


Induced 


2.92E-03 


tropomyosin 1 (alpha) 


285501 


2 


13 


Induced 


3.07E-03 


Human rearranged immunoglobulin lambda light chain mRNA 


119209 


0 


8 


Induced 


3.07E-03 


Insulin-like growth factor binding protein 7 


173043 


0 


8 


Induced 


3.07E-03 


metastasis-associated 1 -like 1 


75866 


0 


8 


Induced 


3.07E-03 


dimethylarginine dimethylaminohydrolase 1 


171695 


0 


8 


Induced 


3.07E-03 


dual specificity phosphatase 1 


TACGGGGATT 


0 


8 


Induced 


3.34E-03 


Novel 


172791 


13 


2 


Repressed 


4.19E-0i 


ubiquitously-expressed transcript 


13561 


13 


2 


Repressed 


4.19E-03 


jESTs, Weakly similar to dJ37E16.5 [H. sapiens} 


98260 


8 


0 


Repressed 


4.34E-03 1 


, ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


' Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3* pre-RNA, subunit 3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-monophosphatase 2 


278941 


10 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19500 


0 


7 


Induced 


7.25E-03 


nuclear localization signal deleted in velocardiofacal syndrome 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


nnne 

yuub 


u 


7 


II IUUUC/U 


7.25E-03 


VAMP (vesicle-associated membrane protein)-associated 










protein A (33kD) 




o 


7 


Induced 


7.25E-03 


Homo sapiens clone HQ0692 


AACTGCTGGC 


12 


2 


Repressed 


7.63E-03 


Novel 


GCTGACCGTC 


7 


0 


Repressed 


8.59E-03 


Novel 




7 


o 


Repressed 


8.59E-03 


Novel 


243901 


o 
c. 


1 1 


Induced 


8.82E-03 


Homo sapiens cDNA FU20738 fis, clone HEP08257 


1 1 y UUU 


1 

1 


Q 


Inrli ifoci 

II IUU(<CU 


9.12E-03 


actinin, alpha 1 


7ft RQR 


-j 


g 


Induced 


9.12E-03 


proteasome (prosome, macropain) subunrt, beta type, 5 




7 
f 


o 


nvpi wwwwu 


9.61 E-03 


trophinin associated protein (tastin) 


180545 


f 




nepresocu 


Q R1P.Q3 

tJtV 1 1— *J>J 


Homo sapiens mRNAfor hypothetical protein (TR2/D1 5 gene) 


77719 


7 


0 


Repressed 


q C1 cno 


' /lommaJiltttamul fnrhnYvlfl^fi * 
yaifUHct-yiuuiiiiyi v^cu uuAyiaoo 


4877 


7 


0 


Repressed 


q C1 c no 
y,u I t-UJ 




22795 


7 


0 


Repressed 


y .d i c-u j 


CO I o 


79335 — 


7 


0 


Repressed 




Homo sapiens SWI/SNF-relaterijnatrix-associated, actin- 








dependent regulator of chromatin D1 (SMARCD1) mRNA 


1 19 177 


y 


1 
1 


ncjJi coocu 


1 .26E-02 


ADP-ribosylatin factor 3 ■■ ' * 


CGGGAGCACC 


9 


1 . 


Repressed 


1 9QP.n9 




182217 


0 


6 


Induced 




ct irr*inatf»-nnA !ina<%a ADP-formina . beta sub unit 


75106 


0 


6 


Induced 




Wi letorin frnmnipmpnt Ivsis inhibitor SP-40. 40. sulfated qly- 






coprotein 2, testosterone-repressed prostate message 2) 


256311 


0 


D 


tnrli tr-ori 

inouceLi 


1 .32E-02 


granin-like neuroendocrine peptide precursor 


103180 


0 


b 


. IMQUUcU 


132E-02 


DC2 protein 


TACGGGGATG 


1 


Q " 

o 


inuuceu 


1 70E-02 

I.I \Jl^ \J£- 


Novel 


AAACAAATCA 


2 


10 


Induced 




Novel 


136644 


2 


10 


Induced 


1 .71 C-02 


Oo DOX"Coniaining vvu ptuitsin 


8036 


2 


10 


Induced 


4 TICJIO 

1 1 t-u^ . 


nlinhlactnma nVArfiVnrP^ftd 
□IIUDIaolUi lla UVWOAp cooou 


74284 


2 . 


10 


Induced 


■4 7*r no 

1.7ib-U<c 


COTc VA/aaW-K/ cirnilartn Similar to S CGr&VlSi39 hVDOthetical 










protem L31 1 1 


ATGGCTGATC 


6 


0 


piepresseo 


1 75F-02 


Novel 


ATCACTGCCC 


6 


0 


Repressed 


1 ,/OC-Ut 


Novel 


ACATCATCAG 


6 


0 


Repressed 




Novel 


CCAGTCCAAG 


6 


0 


Repressed 






54842 


1 


8 


Induced 


h 7oc no 


to is 


1526 


1 


8 


Induced 




ATPa<ip Ha 4-+ transooitina cardiac muscle, slow twitch 2 


7911 


1 


8 


Induced 


1 7RP-nO 


KIAAQ323 nrotein 


227400 


1 


8 


Induced 


•* 70C no 
1 .7oc.-\)d 


mitnnan or^tiv/otoH nrntoln kin3<5fi kinase kinase kinase 3 


77269 


1 


8 


Induced 


■4 7/jr no 


/ii lanino rn iHontiHn KinHinn nrntfiin fG orotsinl. aloha inhibitino 
guanins nucleoli uo umuiiiy jjiuiohi^v^ "■f-" >u 










or^tiuitv nnlv/npntiHp ° 


7943 


6 


0 


Repressed 


1 .81 E-02 


RPB5-mediating protein 


278544 


6 


0 


Repressed 


•i' c no 

1 .81 e-02 


n/sah<l^nan < n/ma A or > ot\/ltranQfpr3<>A ? fflCetoacetvl 
3CGTyi~V^06nZ.yrntJ M auciyiU eu loici aoo t. laooiuauKij i 








Pnan7\/mo A thinlsicpt 
OOof liyn Ic r\ ii iiuicioc/ 


23111 


6 


0 


Repressed 


1.81 E-02 


phenylalanine-tRNA synthetase-like 


31 442 


6 


0 


Repressed 


1.81 E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81 E-02 


hypothetical protein 


77422 


6 


0 


Repressed ' 


1.81 E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81 E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 








S'-S'-exoribonuclease 


171075 


6 


0 


Repressed 


1.81 E-02 


replication factor C (activator 1) 5 (36.5kD) 


89781 


6 


0 


Repressed 


1.81 E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1.81 E-02 


apurinic/apyrimidinic endonuclease(APEX nuclease)-iike 2 
protein 



t 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


279772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


205091 


8 


1 


Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 










(M. muscu/us) 


75658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1 


Repressed 


2.37E-02 


Novel 


105440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


118244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2, regulatory subunit B (B56), delta iso- 
form 


82389 


0 


5 


Induced 


2.43E-02 


CGI-1 18 protein 


31638 


0 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg cell-expressed intermediate filament- 












associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 


0 


5 


Induced 


2.43E-02 


NME7 


153138 


0 


5 


Induced 


243E-02 


origin recognition complex, subunit 5 (yeast homolog)-like 


16034 


0 


5 


Induced 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 


Induced 


2.43E-02 


ESTs 


140452 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 










ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 19 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown (D. melanogaster) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCTCA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


o 


Repressed 


3.03 E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03E-02 


Novel 


7869 


1 


7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransferase-detta 


12101 


1 


7 


Induced 


3.04E-02 


hypothetical protein 


266940 


1 


7 


Induced 


3.04E-02 


t-complex-associated-testis-expressed 1 -like 1 


6196 


1 


7 


Induced 


3.04E-02 


integrin-linked kinase 


366 


1 


7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 


1 


7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49kD) 










(NADH-ceonzyme Q reductase) 


102469 


1 


7 


Induced 


3.04E-02 


putative nuclear protein 


78825 


1 


7 


Induced 


3.04E-02 


matrin 3 


284465 


1 


7 


Induced 


3.04E-02 


ESTs 


30738 


1 


7 


Induced 


3.04E-02 


hypothetical protein FU1 0407 


78687 




7 


Induced 


3.04E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 










factor 


92381 


1 


7 


Induced 


3.04E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


0 


Repressed 


3.26E-02 


DKFZP586O0120 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKFZp762L1 710 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 










UBC7) 


153678 


5 


0 


Repressed 


3.26E-02 


reproduction 8 



i 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


251317 


5 


0 


Repressed 


3.26E-02 


EST 


279623 


5 


0 


Repressed 


3.26E-02 


selenoprotein X 


150319 


5 


0 


Repressed 


3.26E-02 


ESTs 


102456 


5r«.. . 


0 


Repressed 


3.26E-02 


survival of motor protein interacting protein 1 


284250 


5 


0 


Repressed 


3.26E-02 


AD-003 protein 


251871 


5 


0 


Repressed 


3.26E-02 


CTP synthase 


270480 


7 


1 


Repressed 


3.92E-02 


ESTs, Weakly similar to ALU5_H U MAN ALU SUBFAMILY SC 










SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 


glucose-6-phosphatase, transport (glucose-6-phosphate) 
protein 1 


8118 


7 


1 


Repressed 


3.92E-02 


leukotriene A4 hydrolase 


CTCCGCCGGC 


7 


1 


Repressed 


4.47E-02 


Novel 


AGGAAATGCT 


7 


1 


Repressed 


4.47E-02 


Novel 


GCTGACCGAGG 7 


1 


Repressed 


4.47E-02 


Novel 


CCCATAGTCC 


7 


1 


Repressed 


4.47E-02 


Novel 



CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 1 0r 9 dihydrotestosterone for 24 h 

UGD: unigene ID 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One ng of total RNA was reverse 
transcribed into cDNA with oligo dT. cDNA was amplified by 
PCR for various cycles. PCR products were separated on 
agarose gels and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Rg. 1 A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Rg. 1 B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.171995), clusterin (Hs.75106) and SCMH1 (Hs.57475) 
genes. Typical examples of kinetics are shown in Rgure 
1C. PSA was induced at 4-6 h, peaked between 6-20 h, 
and gradually declined after 20 h post-treatment of DHT. 
Clusterin was induced within 0.5-1 h and gradually declined 
after 6-1 2 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, clusterin, and SCMH1 was quantified by 
real-time PCR (Rg. 1D). 



3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17], Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells; 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three-fold change in amount of protein, 
based on prior quantitative studies [1 7}. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS-Rt search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21 up-regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyt transferase, p 1 ,4 galactosyl 
transferase, galactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19]. Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). : 
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Figure 1. Kinetics and quanti- 
fication of androgen-regulated 
genes. Total RNA from un- 
treated and DHT treated 
LNCaP cells was reverse 
transcribed. cDNA was PCR 
amplified for various cycles. 
PCR amplification curves were 
plotted for each gene. A cycle 
number within the logarithmic 
phase of amplification was 
selected for semi-quantitative 
analysis. cDNA was also used 
for real-time quantitative PCR. 
(A) An example of RT-PCR 
amplification of PSA at cycles 
between 20-30. (B) RT-PCR 
of PSA in three independently 
treated LNCaP samples, h: 
hours; 0: no DHT; 24: treated 
with 10"® M DHT for 24 h; -: 
without cycloheximide; +: with 5 |ig/mL of cycloheximide for 24 h. (C) An example of three independent kinetics experi- 
ments for PSA, clusterin and SCMH1. (D) Comparison of SAGE results with that of real-time quantitative PCR for PSA, 
clusterin and SCMH1 . T: induction; j: repression. 





Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine if the changes 
observed at the protein levels were matched by concor- 
dant changes at the mRNA level Remarkably, for most of 
the proteins identified, there was no appreciable concor- 
dant change at the RNA level (Table 4). 



4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, with or without androgen (DHT) treatment. Approxi- 
mately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1.7- 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in 
prostatic kallikrein 2, and a 15.7-fold induction in pro- 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three-fold in fatty acid synthase; two previously identified 
androgen-regulated genes (20, 21]. Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both 0-actin and G3PD, the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0.01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-regulated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose' of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig. 1 B) have smaller variations and some have 
larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large numbers. 



j 
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Figured 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of Identi- 
fied proteins. 
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Protein name or Unigene Protein Change . mRNA Change 

sport number ID ~ ~ in protein Qh 24h inmRN/ 



• 

Adenine phosphoribosyl transferase 


OQ Q1 A 


n Adft 

U.HHO 


1 0ft4 t 


36 


18 I 


Alpha Endosumne 


111 OOU 


U.UDD 


A C7Q t 
U.DrS? | 


o 


5 f 


Alpha Tubluin 


070 0>10 


n QOA 


1 QQ9 t 


AQ 


38 I 


Nucleophosmin, B23 


173 205 


a occ 


A COO t 


to 


o X 


CGI-83 


HQ CCA 

11 o 554 


■ A nco 
U.Uoo 


n 001 t 


9 ' 


3 t 


Creatine Kinase B chain (brain) 


173 724 


0.508 


I.14Z j 


CQ 

oy 


10 1 


Cytokeratin 8 


OXO /ICO 

242 463 


U.lo4 


n QQQ t 

u.oyo j 


oo 


An t 


CytoskeletaJ tropomyosin, NTRK1 


85844 


0.414 


1.174 | 


1 A 
14 


1fi t 

IO i 


Galactocerebrosidase 


273 


0.095 


A yinc + 

U.4UO y . 


Q 
O 


o — . 


Lactoyl Glutathione Lyase 


75 207 


0 


< occ t- 
1 .Zoo I 


07 

- d( 




Myosin Light Chain (1000) 


77385 


0.724 


6.102 f 


H H O 
1 IO 


oo j 


Myosin Light Chain (1001) 


77385 


0.6 


3.212 t 


118 


oo x 


Myosin Light Chain (1 032) 


77385 


2.573 


&694 - \ . 


1 1 O 


oo j. . 


Nm23 


118638 


0.583 


1.978 t 


35 


oc 1 

do x 


Protein phosphatase 6, catalytic 


279563 


0.287 


0.737 T 


2 


2 = 


Protein 


7016 


0 


0.448 t 


!2 


5 T 


Slow Skeletal Muscle Troponin 


84673 


0.067 


0.195 t 


1 


o i 


SRY male gonadal sex determining 


1992 


0 


0.735 T 


ND 


ND 


Synovial sarcoma, X breakpoint 2 


289105 


0.225 


0.593 f 


ND 


tort 

ND 


Terminal deoxynucleotidyl 


272537 


0.392 


0.995 t 


ND 


ND 


Transformer-2 alpha 


119523 


0.074 


0.201 t 


7 


13 T 


284 




0.173 


0.461 t 


* 




402 




0 


0.33 t 






456 




0.154 


0.314 T 






664 




0.297 


0.629 t 






807 




0 


0.377 T 






863 




0.246 


0.665 j 






903 




0.271 


0.632 t 






1018 




0 


0.725 | 






1054 




0.47 


1 .394 T 






1057 




0.451 


0.959 t 






1082 




0.609 . 


1.738 T 






14-3-3 eta 


75544 


0.232 


0.114 1" 


1 


5 T 


tjeia- l ,*f-yEuacxosyi iransierai>c 




1.306 


0.707 I 


NO 


ND 


Cox11 


241515 


1.643 


1.453 i 


3 


0 I 


Dihydrolipoamide dehydrogenase 


74635 


0.844 


0.132 i 


9 


8 i 


G3PD(456) 


169476 


1.191 


0.279 1 


75 


137 | 


G3PD (457) 


169479 


2.051 


0.336 1 


75 


137 T 


Selenium Binding Protein (234) 


7833 


0.411 


0.201 1 


26 


31 T 


Selenium Binding Protein (237) 


7833 


0&74 


0.22 i 


26 


31 t 


651 




0.31 \ 


0.115 i 






745 




0.21 


0.067 i 






892 




0.674 


0.454 i 






1178 




0.933 


0.305 i 







Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 

|: increase; J: decrease; =: equal 
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When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
For instance, in approximately 60 000 transcripts of each 
sample, the PSA tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT. However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCMH1 5 times and 0 times in the corresponding sam- 
ples. Therefore, the SAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PGR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1.7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h, and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with 
DHT, PSA mRNA level was likely to have declined from 
its peak level. Indeed, we could detect 5-10 fold induction 
of PSA at 6-8 h post-treatment. Clusterin was reported as 
an androgen-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli [24]. Our results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1 C). 

Another important question is how many of the andro- 
gen-regulated genes identified are directly induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we will divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensitive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some clues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited sensitivity of 2-D 
PAGE. Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, namely selenium binding protein, brain speci- 
fic creatine kinase, SRY protein, B23 and G3PD. Using 
a subtractive approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down-regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [19]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25]. Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18], Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in cu- list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27]. 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and selenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RNA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translations 
control, post-translational modifications, or changes in 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a lag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 

We would like to thank Dr, Ken Kinzler (Johns Hopkins 
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Based on biomedical literature databases, we tried a first step'fofcohstructing a gene expression ReceiveaTDecember 27, 2004 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- Revised: March 26, 2005 

scriptomic research were available from 12 studies, using various technologies, namely, SAGE, Accepted: March 29, 2005 

cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
cell lines and nine studies of human samples. The total number of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in six studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS- or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale transcriptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 

Keywords: 

Colon neoplasms / Rectal neoplasms / Transcriptomics 



1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome- 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 
analysis of single experiments but from libraries of experi- 
ments. In other words, the results obtained so for by trans- 
lational research tools in different clinical and experimental 
settings need to be compared, contrasted, and if possible 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such "data ware- 
house" specific to human CRC. 

t 
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The specific aims of die present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of " meta-analysis of gene expression" 
in CRC. Our hypothesis was that we would be able to deter- 
mine a common set of genes and gene products that are up- 
or down-regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer* 
and "gene expression'' with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was peiformed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transcriptomics 
and/or proteomics studies), and quantitative results on over- 
expression or underexpression available. Studies concerning 
single genes or arbitrarily selected genes were discarded. 
. Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seatde, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
defined as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide trans criptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor-tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 144. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these studies is provided in. Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free-flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 



3 Results 

One thousand two-hundred and forty genes have been 
reported to be deregulated (up- and/or down-regulated) in 
human CRC, representing about 5% of the 20000-25 000 
human genes. 



* The complete data set can be consulted: 
http:/ /www.chirurgiebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only 144 patients 

Author Year Sample Preparation Method Number 

, of patients 



Zhang era/. [18] 1997 

SAGE-NET [191 1997 

Zhang eta I. [18) 1997 
Parle-McDer m ottetalA 20] 2000 

Yanagawa etal. [21] 2001 

Takernasa et al. [22] 2001 

Nottermann etal. [23] 2001 

Buckhaults et al. [24] 2001 

Birkenkamp etal. [25] 2002 

Lin etat. [26] 2002 

Muroefa/.[27J 2003 

Williams etal. [28] 2003 



Normal mucosa and primary tumors 
Primary tumors and cell lines 
Cell lines 
Cell lines 

Paired normal and cancer 

Paired normal and primary tumors 

Paired normal mucosa and primary ' 

tumors, or adenoma 
Normal mucosa, adenoma and 

i primary tumors 
Paired normal and primary tumors 
Paired normal mucosa and primary 

tumors, or adenoma 
Normal and cancer 
Paired normal mucosa and primary 

tumors, normal and adenoma 



Whole tissue 
Cell lines 
Cell lines 
Cell lines 
Whole tissue 
Whole tissue 
Whole tissue 

Purified epithelial 

cells 
Whole tissue 
Purified epithelial 

cells 
Whole tissue 
Whole tissue 



SAGE 2 

SAGE 

SAGE 

SAGE 

cDNA 20 
cDNA 20 
Otigonuc- 22 

leotrde array 
SAGE 6 



cDNA 
cDNA 



27 
16 



Adaptor-tagged 11 
cDNA 20 
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Table 2. Large-scale or proteomics studies in human CRC, using cell lines or human tissues. All studies together 



involved only nine patients 


Author 


Publication 


Sample 


Method 


Number 




year 






of patients 


Reymond eta/. (9| 


1997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson etal. [29 1 


2000 


Cell lines 


SDS-PAGE 




Lawrie era/. [30) 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson etal. [3D 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed eta/. [32] 


2003 


Insitico 


2-D PAGE 




Demalteefa/. [33] 


2003 


Cell lines 


2-D PAGE 




Stierum etaL [341 


2003 


Cell lines 


2-D PAGE 




Friedman etal, [351 


2004 


Human whole tissue 


2-D PAGE 


6 



The vast majority of deregulated genes in human CRC 
was found using transcriptomics tools: a total of 982-genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study, in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 (up-reg* 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. Out of these 408 
molecules, 339 (83%) were mentioned in a single study, in 
other words these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies, 16 in three studies, 1 0 in four studies, 3 in five studies, 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
(1) provided differential display protein expression data 
obtained in the human patient, using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by .genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteome technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (it =31.) are compared with 
those reported to be differentially expressed in three and 
more proteomics studies (n = 30), only two genes (actin: 
ACTELHUMAN and creatin-kinase: KCRB_HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transcriptome level is low when 
using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
omics results are verified using transcriptomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistently (in three studies or more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 



genes can be matched, representing about two-thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%). However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 



4 Discussion 

This study was aimed at screening transcriptomics and pro- 
teomics studies published in CRC, and to elaborate a kind of 
meta-analysis of gene expression in CRC in order to generate 
a data warehouse that would be useful in translational 
research. A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able to 
determine a common set of genes and gene products that are 
up- or down-regulated in human CRC by comparing results 
obtained by different authors. This endeavor was not very 
successful; obviously, overlapping between published gene 
expression results both at proteome and transcriptome level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogether 144 patients in transcriptomics, 11 in prote- 
omics studies); some high impact publications having been 
conducted on samples of only two patients [18J. This is a 
problem because interindividual genetic variability is high in 
human CRC (2]. Thus, it is allowed to hypothesize that some 
results attributed to gene dysregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptomics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 



Accession 

Number Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Lin et al 2002 [26] 


I 

§ 

© 
Q 

o 

s 

i 

m 

CL 


Zhang et al 1997 [18] 


SAGE-NET et all 997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


Paired normal and cancer 


Normal and cancer 


E 

tH 
CL 
TO 
C 
<0 
A 

E 
o 
c 

* 
"O 

«l 

o 
zs 

Z 


Paired n. and pr.tumors 


Paired n. and pr.tumors! 


Paired n.muc and pr.tum or adenoma 


Cell lines ! 


| 

*o 

e: 

CO 

o 
=1 
E 

i 

o 
Z 


Primary tumors and cell lines 


Cell tine 


Paired n.muc and pr.tum.n and adenoma 


Paired rijituc and pr.tum or adenoma 


a 
u 


Adaptor-tagged 


LU 

CO 


cDNA 


< 
z 

? 


< 
z 
o 

u 


SAGE 


SAGE 


SAGE 


SAGE 


2 

a 

o 


Oligonucleotide Array 


AF041260 


075363 


IFM2.HUMAN 


Interferon-inducible protein 1-Bd 


1 




1 




1 






1 










AA232508 


Q9Y397 


SAHH__HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 






1 


1 










1 


1 






AA007218 


Q96QY8 


BGH3_HUMAN 


Transforming growth factor, beta-induzed, 66 kDa 


1 




1 


1 










1 








6011 1 9 






IChaperonin containing TCP-1, subunit4 (delta) 












1 








1 


I 




X75821 


Q04984 


CAtl.HUMAN 


C0L1A1 










1 


1 




1 










U22055 


Q96AG0 


TF3A_HUMAN 


General transcription factor II IA 








1 




1 










1 




U14631 


P80365 


GROJHUMAN 


GR0I oncogene (melanoma growth stimulating 
activity, alpha) 




















1 


1 




M92439 


P42704 


MDP1.HUMAN 


MPDP4, MDP7 microsomal dipeptidase 






1 


1 


















AF029082 


P31947 




M-phase inducer phosphatase 2 






1 




















M86400 


P29312 




Pyrroline 5-carboxilate reductase mRNA . 






1 




















L76465 


P15428 


SPRC.HUMAN 


Secreted protein, acidic„cysteine-rich 
(osteonectin) \ 






1 




















X57352 


Q01628 


Q96AG0 


100 kDa coactivatormRNrA : 


























X03205 


P62888 


RL30.HUMAN 


SOS Ribosomal protein L30* . 


























M95787 


Q01995 


RL23_HUMAN 


60s ribosomal protein L23 


























Z25821 


P42126 


HS9B_HUMAN 


90-kDa HSP 


























U96132 


Q99714 


RLA2.HUMAN 


Acidic ribosomal phosphoprotein P2 mRNA 


























AB028893 


P04643 


CA14.HUMAN 


Alpha-1 chain of collagen IV 


























X79239 . 


Q02546 


APA1_HUMAN 


Apo lipoprotein A-l 


























M13934 


P06366 


BMP4JIUMAN 


Bone morphogenetic protein-2B (BMP-2B) 


























M60854 


P17008; 


CSE1_HUMAN 


CAS chromosome segregation gene homologue 


























M13932 


P08708 


CD81_HUMAN 


CD81 antigen 


























X69150 


P25232 


TCPZ_HUMAN . 


Chaperonin containing TCPl, subunit 6A(zeta1) 


























L06498 


P17075 


CA21.HUMAN 


Collagen alpha-2 type 1 
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Table 3. Continued 



TRANSCRIPTOMICS 



Accession 

Number Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


I 

« 

CL 

E 

re 

c 
S 


§ 

re 
Z 
5 

| 


i 

To 

% 
c 


I 

e 

§ 

« 

a 

o 

£ 

i 

a 


Zhang et al 1997 [18] 


5? 

I 

z 

ut 
XS 
< 
CO 


CO 

I 

e 

13 

-e 

IM 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


o 

o 


Adaptor-tagged 


LU 

cs 

S5 


< . 

Z 

o 

u 


< 

SE 
D 

o 


< 

Q 

o 


U> 
< 


C9 

S 


C9 
< 
CO 


tea 


-se- 
as 
o 
o 


Oligonucleotide Array 


L04483 


P35265 


:02 HUMAN 


Complement component C2 


1 


1 






















M64716 


P25111 


R10AJWMAN 


Csa-19mRNA 


-j- 


1 






















X77770 


P02383 


R0A1 HUMAN 


DNA binding protein UPI, liver mRNA fragment 




1 






















W52460 


P14798 


TP2A_HUMAN 


DNA topoisomerase II |top2) 


1 


J 






















X557I5 


P23396 


IF39.HUMAN 


Eukaryotic translation inrtialion factor(EIF3)mRNA 


























M77234 


P49241 


IF2B.HUMAN 


Eukaryotic translation initiation factor 2, subunit 2 




1 






















M58458 


P12750 


GGH.HUMAN 


Gamma- giytamyl hydrolase 


























D16992 


P10660 


G3P2 HUMAN 


Glycer aldehyde 3 phosphate dehydrogenase 


























M77233 


P23821 


SYG_HUMAN 


Glycyl-tRNA synthetase 


























F16294 


PO9058 


RS8_HUMAN 


H19RNA 


























X61156 


P08865 


HS9AJWMAN 


Heat shock Protein HSP 90-Alpha 


























U605S 


Q13541 


HMGLHUMAN 


Hmgl mRNA for high mobility group protein 1 


~- 
























AA316619 


P04645 


RS23.HUMAN 


Human homolog of yeast ribosomal protein S23 
(D14530) 


























LI 4599 


Q15233 


IMD2.HUMAN 


IMP (inosine monophosphate) dehydrogenase 2 


























X79234 


P39026 


IFM1_HUMAN 


Interferon induced transmembrane protein 1(9- 
27} 


























L06505 


P30050 


LOHB.HUMAN 


Lactate dehydrogenase B(LDH-B) 


























X64707 


P26373 


RS2 HUMAN 


LLRep 3 


























X56932 


P40429 


4F2.HUMAN 


Lymphocyte activation antigen 4F2 large subunit 


























L25899 


P39030 


NPL1_HUMAN 


Nucleosome assembly protein 1-like 1 


























X63527 


PH118 


GSHH.HUMAN 


Phospholipid hydroperoxide glutathione per- 
oxidase 


























X89401 


P46778 


GDFF.HUMAN 


Prostate drfferentation factor 


























H59771 


P23131 


RL6.HUMAN 


Ribosomal protein L 6 


























M94314 


P38663 


RL18_HUMAN 


Ribosomal protein L18(RPL18) mRNA 


























LI 9527 


P08526 


RL1X_HUMAN 


Ribosomal protein Ll8a mRNA, complete cds 


























U14968 


P46776 


RL7.HUMAN 


Ribosomal protein L7 


























U14969 


P46779 


RL8.HUMAN 


Ribosomal protein L8 


























L38941 


P49207 


RS5_HUMAN 


Ribosomal protein S5 


























U12465 


P42766 


RSt9_HUMAN 


S 19 ribosomal protein 


























F19234 


P18077 


MYCJWMAN 


V-myc myelocytomatosis viral oncogene 
homolog (avain) 


























X66699 


PI 2751 


Q99497 


EST(PARK7; DJ1) 


1 


1 























) 
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Table 4. Results of transcriptomics research in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 



Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takeinasa et al 2001 [22] 


§ 

ra 

c' 


Parle-McDermott 2000 [20] 


CO 

"3 

% 

• a> 
s 

(9 


SAGE-NET etal 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


| Paired normal and cancer 


Normal and cancer 


E 

3 

43 

a. 

*o 
c 

CD 

<s 
E 
o 
e 

Ol 
TO 

CS 

o 

3 

Z -' 


Paired n. and pr.tumon 


Paired n. and pr.tumorc 

t 


Paired n.muc and pr.tuin or adenoma 


Cell lines 


I 
£ 

-o 

e 

« . 
o 

1 
i 

o 
Z 


Primary tumors and cell lines 


Cell line 


Paired n.muc and prtum.n and adenoma 


Paired n.muc and pr.tum or adenoma 


< . 
z 

o 


Adaptor-tagged 


Ui 

- iS 
< 


< 
z 
o 

o 


< 
z 

a 


< 
z 

o 


SAGE 


< 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


CAH2_HUMAN 


Carbonic anhydrase II 


t 




1 




1 








1 




1 


1 


CEA1.HUMAN 


EST (BILIARY GLYCOPROTEIN) 






1 


1 


1 








1 




1 


1 


BENE_HUMAW 


BENE 


1 








1 






t 








1 


CAH1_HUMAN 


Carbonic anhydrase 1 


1 




1 












1 






1 


GUAU_HUMAN 


GCAP-II guanytate cyclase activator 2B; Uroguanylin; UGN 


t 
















1 






1 


FABL_HUMAN 


L-FABP liver fatty acid-binding protein 1 


1 








1 








1 








ACDS.HUMAN 


Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 






















1 


1 


ADHG_HUMAN 


ADH gama 2 subunrt (a a- 1-37 5) 










1 














1 


T4S3_HUMAN 


Co -029; transmembrane 4 superf amity member 3 


















1 








DRA_HUMAN 


Colon mucosa-associated |D RA} 


















1 








KCRB.HUMAN 


Creatine kinase 


1 
















1 








GUAN_HUMAN 


GUCA1B guanulate cyclase activator IB/guanylin 


















1 






1 


095784 


IgG Fc-binding protein 


1 








1 
















ITMC.HUMAN 


Integral membrane protein 2C 










1 


1 






1 








MK03_HUMAN 


MAPK3 


















1 




1 




FXY3.HUMAN 


MAT8 protein 










1 








1 








ACTB.HUMAN 


mRNA fragment encoding cytoplasmic actin 


1 












1 


i 










SELP_HUMAN 


Selenoprotein P 










1 










1 






EST1_HUMAN 


Liver carboxylesterase 1 [Precursor] 


















1 


1 




1 


CFAD.HUMAN 


Adipsin, complement factor DCDA (EST) 
























1 


ATPB.HUMAN 


ATP5B ATP synthase 










1 
















CAHC_HUMAM 


CA12 










1 
















CANS.HUMAN 


Calcium dependent protease (small subunit) 
















i 










CALM.HUMAN 


Calmodulin-I (CALM1 ) mRNA, 3' UTR, partial sequence 






















1 




CAH4_HUMAN 


Carbonic anhydrase IV ; 








1 
















1 
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Table 4. Continued 

TRANSCRIPTOMICS 



Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


1 

"5 

Q> 
C 


I 

§ 

s 

i 

CD 

o. 


So 

at 

"to 

% 

a» 
c 
2 

N 


57 

£ 

15 
3 

fc 
z 

to 


Zhang et al 1997 [18] 


Williams et al 2003 [28] | 


Notterman et al 2001 [23] 


z 

Q 


Adaptor-tagged 


SAGE 


< 
z 
o 

o 


< 
S 

o 


< 

Z ' 
Q 
U 


SAGE 


Ul 

< 


LU 

< 

CO 


LU 
< 


cDNA 


Oligonucleotide Array 


MT1E.HUMAN 


cDNA similar to gb: M10942__cdsl human 


















1 






1 


000748 


CES 2 carboxylesterase 2 








1 


1 
















CMGA.HUMAN 


CgA 








1 
















t 


CLUS.HUMAN 


clusterin 
























1 


KlCT.HUMAN 


Cytokeratin 20 


















1 






1 


KDGAJWMAN 


Diacylglycerol kinase 








1 
















1 


DTD.HUMAN 


DTD sulfate transporter 










1 
















PLA8 HUMAN 


EST 122594 5_ 


















1 






1 


ATPA_HUMAN 


FI-ATPase alpha subunh 








1 


1 
















LE63 HUMAN 


Galectin-3 (Galactose-specific lectin 3) (MAC-2 antigen) 








1- 


1 
















ABP_HUMAW 


HP-DA01 (diamine oxidase) 




















i 




1 


X3CS.HU MAN 


Keratin 19 












1 








I 






K2C8..HUMAN 


Keratin, type II cytoskeletal 8 (cytokeratin 8) (K 8) (CK 8) 














1 






1 






DHB2_HUMAN 


L1 1708 Estradiol-17 beta- dehydrogenase 2 












1 












1 


MT1H HUMAN 


Metallothionein 1H 
























\ 


MUC2.HUMAN 


Mucin 2, intesonalftracheal 














1 


1 










MEP A_HUM AN 


PPH alpha gene 




















1 




1 


PA2/LHUMAN 


RASF-A PIA2 gene 












1 










1 




MT1F.HUMAN 


RNA helicase-related protein 
























1 


SBP1_HUMAN 


SBP selenium-binding protein 












1 


1 












MT1LJUJMAN 


Serine theonine kinase 39 (STE20/SPS1 homolog,yeast) 
























1 


MYL6 HUMAN 


Myosin light polypeptide 6 


















1 






TETN.HUMAN 


TNA tetranectin 












1 












1 


VIPRJ1UMAN 


VIPR1 vasoactive intestinal polyoeptide receptor 1 












1 












1 


PLA8.HUMAN 


Placenta-specific gene 8 protein 




















1 




1 


Q9NXM9 


Hypothetical protein FU20151 
























1 


T4S1_HUMAN 


Transmembrane 4 superfamtly member 1 








1 












1 






ALU2_HUMAN 


Alu subfamily SB sequence contamination warning entry 






1 






1 














S116_HUMAN 


S100 calcium-binding protein A16 


















1 


1 






C14A.HUMAN 


Dual specificity protein phosphatase CDC14A 




















1 


1 





Second, results have been obtained using heterogeneous 
samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 
obtained in cell lines do not allow accurate comparison be- 
tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 
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colonic crypts were compared with the CRC cell line LIM 
1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 
levels of a large number of proteins differed [3]. When clin- 
ical biopsies are examined, results of phenotypic compare 
isons depend on the type of samples examined (eg., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 
Number 



Swiss-Prot Entry Name 



Name 



g _ 

f7 1 r% t SL S* S *t 



3 



AF041260 



AA232508 



075363 



ACTB_HUMAN InRNA fragment encoding cytoplasmic actin 



Q9Y397 



ACTG_HUMAN Actin, Cytoplasmatic 2 (Gamma-Actin) 




AA007218 



Q36QY8 



6TP_HUMAN [Glutathione S-transferase M3 (brain) 



i X77584 



601119 



THI0.HUMAN [Thioredoxin (ATL-derived factor) (ADR 



X75821 



U22055 



Q04984 



ANX3_HUMAM I Annexin Ml (Lipocortin III) 



Q96AG0 



ANX4_HUMAN [Annexin IV (Lipocortin IV) 



U14631 



M92439 



P80365 



K1CR_HUMAN j Keratigtype I cytoskeletal 18 (cytokeratin 18)(K18) |CK18) 



P427Q4 



ATPB J1UMAN I ATP5B ATP synthase 



AF029082 



P31947 



PPIAJJUMAN | Peptidyl- Prolyl Cis-Trans Isomerase A (Cyclophilin A) 



M86400 



P29312 



POA3_HUMAN jProbable protein disulfide isomerase Eft-60 precursor 



L76465 



PI 5428 



TPIS_HUMAN jTriosephosphate isomerase (TIM) 



X57352 



Q01628 



TCTPJJUMAN jTranslationalry controlled tumor protein (TCTP)(p23) 



X03205 



PI 0809 



CH60_HUMAN j Heat shock protein 60 



M95787 



Q01995 



VINC HUMAN IVinculin 



EF1G_HUMAN ""[Elongation factor 1-gamma mRNA 



Z25821 



U96132 



A8028893 



X79239 



P42126 



Q99714 



TBA1_HUMAN 1 Tubulin alpha-1 chain, brain specific 



P04643 



CATP.HUMAN ICathepsin P (EC 3A23..5).{GENE.CTSD}Homo sapiens 



Q02546 



EFt INHUMAN Elongation factor 1 -alpha, mRNA 



M 13934 



MG0854 



P06366 



G3P2_HUMAN [Glyceraldehyde 3 phosphate dehydrogenase 



P17008 



GR78_HUMAN !78kDa glucose-regulated protein precursor (GRP 78) 



3- 



M 13932 



P08708 



EN0A_HUMAN {Alpha enolase (2-phospho-D-grycerate hydrolase) 



X69150 



L06498 



104483 



P25232 



CRTC.HUMAN | Calrecticulum precursor (CR55) (calregulin) 



mm 



PI 7075 



COF1_HUMAN jCofilin, non-muscle isoform 



1:> 



P35265 



HS7C_HUMAN Irleat shock Cognate 71 kDa protein 



M64716 



P25111 



HBB.HUMAN ! Hemoglobin beta chain.{GENE: HBBjh.s 



X77770 



P02383 



K2C8_HUMAN I Keratin, type II cytoskeletal B (cytokeratin 8) (K 8) (CK 8) 



W52460 



PI 4798 



RM12_HUMAN j Mitochondrial 60s ribosomal protein L7/I12 precursor 



mm 



X55715 



P23396 



GR75_HUMAN Mitochondrial stress-70 protein precursor (GRP 75) 



M77234 



M58458 



P49241 



S0DC_HUMAN (Superoxide Dismutase (Cu-Zn) 



mm 



PI 2750 



KCRB_HUMAN Creatine kinase 




01 6992 



PI 0660 



143ZJ1UMAN 1 14-3-3 protein zeta/delta (protein kinase C inhibitor protein- 1) 



M77233 



P23821 



RL4_HUMAN |60s ribosomal protein L4 (L1) 



F16294 



P09058 



ANXt HUMAN 



j Annexin I (lipocortin I) (calpactinll) (chromobindin 9) (p35) 



X61156 



L36055 



P08865 



ROA1_HUMAN | DNA binding protein UPI, liver mRNA fragment 



Q13541 



FAB INHUMAN il-FABP liver fatty acid-binding protein 1 
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Table 5. Continued 



Accession 

Number Swiss-Prot Entry Name Name 


Simpson et al 2000 [29] 


Simpson et al 2001 [31] 


Oemalte et al 2003 [33] 


Reymond et al 1999 [9] 


Medjahed et al 2003 [32] 


Lawrie et al 2001 [30] 


Friedman et al 2004(35] 


1 

15 


UI 

ca 
<C 

CO 

o 

CO 


Free-flow electrophoresis 


2-0 PAGE 


2-0 PAGE 


2-0 PAGE 


ui 

s 

a 


UI 

% 

a 

eil 


Ui 

S 

a. 
o 

tit 


AA316619 


P04645 


PAB1JIUMAN 


Poly( A) binding protein, mRNA 


m 
















LI 4599 


Q15233 


RL3_HUMAN 


Ribosomal protein L 3 


m 














X79234 


P39026 


RS19.HUMAN 


S 19 ribosomal protein. 


11 














L06505 


P30050 


SAHH.HUMAN 


S-Adenosyfhomocysteine hydrolase AHCY) 




W$ 














X64707 


P26373 


TERA_HUMAN 


Transitional endoplasmic reticulum ATPase (TER ATPase) 


i| 














X56932 


P40429 


GDIR.HUMAN 


Rho GDP-dissociatioh inhibitor 1 














mm 


L25899 


P3903O 


ACONJIUMAN 


Aconitate hydratase, mitochondrial precursor (aconitase) 


m 
















X63527 


P14118 


AKA1_HUMAN 


Alcohol dehydrogenase (NADP+) (aldehyde reductase) 


m 
















X89401 


P46778 


AAC4.HUMAN 


Alpha-Actinin 4 


M 
















H5977I 


P23131 


NPMJUJMAN 


B23 nucleophosmin 


m 




i" 












M 943 14 


P38663 


£ATA_HUMAN 


Catalase 


















LI 9527 


P08526 


CD44.HUMAN 


CD44 antigen (Phagocytic glycoprotein 1) 






i 












U14968 


P46776 


TCPZ_HUMAN 


Chaperonin containing TCP1, subunit 6A(zeta1) 






m 












U14969 


P46779 


COXA.HUMAN 


Cytochrome c oxidase polypepti-de Va, mitochondrial (EC 1.9.3.1) 
(GENE:COX5A}H.s 






-M 












L38941 


P49207 


C0XB_HUMAN 


Cytochrome C oxidase polypeptide VB precursor.COXBB 


?M 
















U 12465 


P42766 


EF1B_HUMAN 


Elongation factor 1-beta (EF-1-beta) 






%i 












F19234 


PI 8077 


ER29.HUMAN 


Endoplasmatic reticulum protein ERp29 (ERp31) (ERp28). 






\M 












X66699 


PI 2751 


EZRLHUMAN 


Ezrin (p81) (cytovillin)(villin-2) 


















D23660 


P36578 


ALFA_HUMAN 


Fructose-Biphosphate Aldolase A (Muscle-Type Aldolase) 


















U 14966 


P46777 


DHE3_HUMAN 


Glutamate dehydrogenase 1 precursor (GDH) 


















X06705 


P11518 


HS9AJWMAN 


Heat shock Protein HSP 90 -Alpha 


















D14531 


P32969 


R0LHUMAN 


Heterogeneous nuclear ribonucleoprotein L (HNRPN 1) 


















U30255 


P52209 


K2C1_HUMAN 


Keratin, type II cytoskeletal 1 (cytokeratin 1) (K1) (CK1) 




m 














M81182 


P28288 


PDXLHUMAN 


Peroxiredoxin 1 (EC 1.1 1.1.-) (Thioredoxin peroxidase 2) 






T 












U04627 


P40939 


060506 jPRM RNA binding protein Gry-rbp ( GRY-RBP)mRNA 


















X87949 


PI 1021 


PHBJWMAN | Prohibrtin 


















M16660 IP08238 


PDLHUMAN 1 Protein disulfide isomerase precursor (PDI) 






= & 












U79725 IQ99795 


SPCB_HUMAN (Spectrin beta chain, erythrocyte (Beta-I spectrin) 






i.1-:. 












D10511 IP24752 


PDX3_HUMAN jThioredoxin-dependent peroxide reductase, mitochondrial 






: 1. 












M55040 IP22303 


TPM3_HUMAN j Tropomyosin alpha 3 chain (T ropomyosin 3) 






• i' 






11 






AF102542 1 095395 


NDKA_HUMAN | Non-metastatic cells 1, protein (NM23A)expressed in 


















W45148 ! P24666 


143S_HUMAN 1 14-3-3 PROTEIN SIGMA 






;T- 












Ml 7885 i P05388 


RL24_HUMAIM !60s ribosomal protein L24IL30} 


















M17886 I P05386 


APA1 jHUMAN ! Apolipoprotein A-l 






i 












M17887 IP05387 


DHSA_HUMAN jS0H2 succinate dehydrogenase flavoprotein subunit 






1 









} t 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and vice versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics 
tools (25%). Only two gene products can be retrieved in three and more proteomics and transcriptomics studies (actin and creatin- 
kinase) 

a. Transcriptomics > Proteomics 



Number " Swiss-Prot Entry Name 




AFD41260 



075363 



CAH2.HUMAN 



Carbonic anhydrasa II 



AA232508 



Q9Y397 



CEA1_HUMAN 



EST (BILIARY GLYCOPROTEIN) 



AA007218 



Q96QYB 



BENE_HUMAN 



BENE 



X05014 



G01119 



CAH1.HUMAN 



Carbonic anhydrase I 



X75821 



Q04984 



GUAU_HUMAN 



GCAP-II guanytate cyclase activator 2B; Uroguanylin; UGN 



U220S5 



IFM2_HUMAN 



Interferon-inducible protein 1-Bd 



U14631 



FABL.HUMAN 



l-FABP liver fatty acid-binding protein I 



M92439 



P42704 



SAHH_HUMAN 



S-AdBnosylhorno cysteine hydrolase AHCY) 



ARE 9082 



P31947 



BGH3_HUMAN 



Transforming growth factor, beta-induced, 66 kOa 



M86400 



P29312 



ACDS_HUMAN 



Acyt-Co enzyme A dehydrogenase C-2 to C-3 short chain 



L76465 



P15428 



ADHG.HUMAN 



ADH go ma 2 subunit (aa-1-375) 



X57352 



001678 



CC16.HUMAN 



A PCS or CDC16Hs cell division cycle 16 



X03205 



P50991 



TCPD.HUMAN 



Chaperonin containing TCP- 1, subunit 4 (delta) 



M957B7 



Q01995 



T4S3.HUMAN 



Co-azSttransmembrane 4 superfamily member 3 



Z25821 



P42126 



CAII.HUMAN 



C0L1A1 



U96132 



Q99714 



DRAJJUMAN 



Colon mucosa- associated (DRA) 



ABQ28S93 



P04643 



KCRB_HUMAN 



Creatine kinase 



X79239 



QOZ546 



TF3A.HUMAN 



General transcription factor HI A 



M13934 



P06368 



GRO.HUMAN 



GR01 oncogene! melanoma growth stimulating activity, alpha! 



M60854 



P17008 



GUAN_HUMAN 



GUCA1B guanulate cyclase activator iB/guanylin ' . 



M 13932 



P08708 



095784 



IgG Fc-btnding protein 



X69150 



P25232 



L06498 



P17075 



L04483 



P35265 



M 6471 6 



P25111 



X77770 



P02383 



Integral membrane protein 2C 



MAPK3 



MATS protein 



MPDP4, MDP7 microsomal dipeptidsse 



M- phase inducer phosphatase 2 



W52460 



P14798 



X55715 



P23396 



M77234 



P49241 



M58458 



P12750 



016992 



P10660 



ACTB.HUMAN 



PRpC.HUMAN 



SPRC_HUMAN 



SELP^HUMAN 



EST1_HUMAN 



mRNA fragment encoding cytoplasmic actin 



Pyrroline 5-carboxilate reductase mflNA 



Secreted protein.acidic. cysteine-rich (osteonectin) 



Selenoprotein P 



EST 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) [4], laser capture microdissection (LCM) [5-8], 
immunomagnetic beads separation [9], and cellular frac- 
tionation [10]. Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads-based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on the arrays 
technology platform. In transcriptomics studies any factors 
may affect the outcome of a microarray experiment, in par- 
ticular technical, instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major^cpneern in microarray studie^[T2]. Whe n cross-, 
platform comparison was performed, reproducibility was 
insuflBcient only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [13]. 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CV of repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (interrun) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment Interindividual phenptypic variability was high. 
Thus, even under well-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis. Unfortunately, due to the small 
sample size (number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In particular, assessing 
the reproducibility of a variable is necessary (eg., using the 
intraclass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge- 
nous controls [15]. A dedicated society, The Microarray Gene 
Expression Data (MGED) Society, has been formed to facil- 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16]. 

Finally, correlation between results of transcriptomics 
versus proteomics results is low. For CRC, there is no pub- 
lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is .reasonable from 
another epithelial cancer (lung adenocarcinoma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance [17]. 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transmptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3-3 proteins 
family. 14-3-3 proteins are ubiquitous within all eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependei?.t 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics [37]. Given the prevalence of specific 
14-3-3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets in human CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards (such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
these high quality, well-annotated data within the life sci- 
ences community. 

In the absence of such unified processes, reproducibility of 
results within a laboratory will continue to be low, confirma- 
tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These problems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 

We arc very grateful to G. Nestlerfor her excellent technical 
assistance. 
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Gene Expression Analysis 
Using Microarrays 

Sophie E. Wildsmith and Fiona J. Spence 



131 Introduction 

Microarrays are of increasing interest to both industry and academia as tools 
for 'gene hunting' and also as quantitative methods for routine analysis of large 
numbers of genes. Techniques such as real-time polymerase chain reaction (RT- 
PCR) TaqMan™ and SybrMan™ are generally considered to be more accurate, 
robust, larger in dynamic range and less capital intensive, but for rapid, laige- 
scale gene expression analysis using limited mRNA, microarrays and gene chips 
are preferred. 



13.2 Microarray experiments 
Platforms 

Global gene expression platforms are now available in multiple formats, 
including cDNA arrays, oligonucleotides spotted onto slides or in situ 
synthesised oligonucleotide arrays manufactured using photolithography. 
Commercial sources for these include Stratagene (La Jolla, CA), Memorec 
(K5ln, Germany) and BD Biosciences (Oxford, UK) for cDNA microarrays, 
Mergen Ltd (San Leandro, CA) for spotted oligomers and Affymetrix (Palo 
Alto, CA) for oligoarrays synthesised in situ. Purchasing from a supplier is more 
expensive than generating microarrays in-house, although the latter is beneficial 
in labour-intensive institutions or when proprietary gene information is utilised. 
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'Off-the-shelf microarrays may also have the advantage of rigorous quality 
control and standardised protocols. 

It is possible to produce oligonucleotide spotted arrays in-house, by design- 
ing oligonucleotide sequences that match genes of interest and then purchasing 
purified oligonucleotides to spot down on glass or other substrates. Alternatively 
there are new systems such as that available from CombiMatrix (Mukilteo, 
WA, USA) for computer-aided design and in situ synthesis of oligonucleotides. 
However, production of cDNA microarrays is currently the most affordable and 
popular method and is now well established. Numerous sources of informa- 
tion on cDNA microarray fabrication are available in the literature and on the 
internet (Bowtell, 1999; Cheung et ai 9 1999; Wildsmith and Elcock, 2001 and 
http://cmgm.stanford.edu/pbrown). Thus, this chapter will focus on the 
implementation of experiments and analysis of data ftojn cDNA microarrays. 
The experimental procedure differs slightly according to the number of fluo- 
rophores (or channels) and the type and manufacturer of the array. We have 
attempted to describe a generic process, indicating where possible the different 
options. Figure 13.1 demonstrates the procedure for a two-colour hybridisation. 



Sample 1 Sample 2 




Figure 13.1 The microarray experimental process for two-colour hybridisations 



RNA Extraction 
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First, RNA is extracted from the tissue or cells of interest. The quality of the 
RNA extracted is paramount to the overall success of the microarray experi- 
ment, as impurities in the sample can effect both the probe labelling efficiency 
and also stability of the fluorescent label (Hegde et al., 2000). Snap-freezing 
of tissue in liquid nitrogen, immediately after harvesting, is used to preserve 
RNA integrity. Any further sectioning of the tissues should be carried out under 
RNAse-free conditions (Fernandez et al., 1997). Total RNA can be extracted 
using kits such as TRIzol® (Invitrogen, Paisley, Scotland) and Rneasy (Qia- 
gen, GmbH, Hilden, Germany). Some researchers perform a further extraction 
of mRNA; this results in a purer starting material but has the disadvantage of 
lower yields. Affymetrix recommend between 5 and 40 u.g of total RNA is 
required for their GeneChips™ and 10 u.g'or less is the required amount of 
starting material for cDNA microarrays (Hegde etal., 2000). 



Sample .Labelling 

The mRNA is transcribed in vino, with the concomitant inclusion of labelled 
nucleotides. The labels may be fluorescent or radioactive. In the case of dual 
channel/colour hybridisations, two samples will be labelled with dyes that 
fluoresce at different wavelengths, with different emission spectra. Example 
fluorophores, available coupled to nucleotides, are Cy3, Cy5, fluorescein 
and lissamine. Wildsmith etal. (2001) have demonstrated that AlexaFluor 
546dUTP™ (Molecular Probes, Leiden, The Netherlands) gives a significantly 
higher signal than Cy3dCTP (Amersham Biosciences, Piscataway, NJ, USA). 
When performing two-colour hybridisations the control sample and 'test* sample 
are labelled with different fluorophores and the subsequent cDNA is then mixed 
together and hybridised simultaneously (Nuwaysir et al., 1999). An advantage of 
simultaneously hybridising control and treated sample is that it obviates the need 
to control for differences in hybridisation conditions or between microarrays. A 
specific example of the huge impact this technique has had includes its use 
in the first published account of gene expression data of the entire genome of 
Sacchammyces cerevisiae (DeRisi et al., 1997). In two-colour hybridisations, 
one would assume that the properties of the two fluorescent dyes being used 
are equivocal. In fact, for Cy5 and Cy3 this is not the case as Cy5 has been 
reported to give higher background fluorescence and also is more sensitive to 
photobleaching than Cy3 (Van Hal etal., 2000). In addition, there is evidence 
from several independent sources that the combination of Cy3 and Cy5 dye 
labelling can affect data in certain genes. That is to say, when experiments 
are repeated and the dye combination for the two probes reversed, inconsistent 
results are obtained with certain genes (Taniguchi etal, 2001). Despite these 
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Treated : Cy 3-S32 nm Control : Cy 5-635 nm 



Figure 132 Two-colour fluorescent scan of human gene cDNA array. The probe mix con- 
sists of DNA from HcpG2 control cells and cells treated with buthionine sulfoximine for 6 h. 
A colour version of this figure appears in the colour plate section 

facts, two-colour hybridisations arc widely accepted throughout the microarray 
community and an example of an image is shown in Figure 13.2. 

Hybridisation and Processing 

After labelling, the cDNA is purified (to remove unincorporated nucleotides), 
mixed with a hybridisation buffer and then applied to a cDNA microarray slide. 
The sample and the slide are heated prior to hybridisation in order to separate 
double-stranded DNA. A coverslip is applied (Shalon et al. t 1996), or prefer- 
ably a hybridisation chamber is used to avoid evaporation and enable an even 
hybridisation. The hybridisation and subsequent wash steps are carried out at a 
buffer stringency and temperature that enables hybridisation of complementary 
strands of DNA but reduces non-specific binding. 

Image Capture and Image Analysis 

After hybridisation the microarray slides are scanned, using either a laser or a 
phosphorimager (depending on the type of label used). There are many different 
suppliers and models of fluorescence scanner, for example the ScanArray 5000 
(Perkin Elmer Life Sciences, Zaventem, Belgium), GenePix 4000B (Axon GRI, 
Essex, UK) and the GeneArray® (Affymetrix, Santa Clara, CA). The choice of 
scanner is determined by sensitivity, resolution, flexible wavelength, file size 
generated, throughput and technical support available. 

Images are analysed using software that measures the intensity of the sig- 
nal from the hybridised spotted genes (spots), which provides a measurement 
of the amount of cDNA bound. Thus the initial concentration of messenger 
RNA is inferred. Early software packages 'drew' grids around the spots and 
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integrated across the whole area of the grid. This overcame problems associ- 
ated with accurate location of the spots, which is problematic, especially if the 
spots on the printed arrays are poorly aligned. More recent versions of software 
'draw' circles around the spots themselves and perform measurements within 
and outside of this boundary. For example, the background may be calculated 
from a region outside the spot boundary. The intensity of the signal from the 
spot may be calculated using median, mode or mean values of the pixels within 
the spot. Researchers differ in their preferences regarding using median or mean 
values (Hegde et al. f 2000) and this is likely to depend upon the protocols and 
software used. 

Image analysis software commonly is supplied with scanners or can be 
bought from the same supplier. This has the advantage of being opti- 
mised for that specific type of microarray and the benefit of upgrades 
and technical support Software for microarray analysis is available 
from BioDiscovery (http://www.biodiscovery.com), Imaging Research 
(http://imagingreaearch.com), GenePix Pro (Amersham Biosciences, Pis- 
cataway, NJ), arraySCOUT 2.0 (http://www.lionbioscience.com), NIH 
(http://www.nhgri .nih.gov/DIR/LCO/lSK/HTML/imgjmalysis .html), 
Stanford University (http: //rana . Stanford .EDU/sof tware) Media Cyber- 
netics (Silver Spring, MD, USA) and TIGR (http//www . tigr . org/sof tlab). 
Important criteria for image analysts software include speed, ease of use, 
automation and the ability to distinguish artefact from real signal (Wildsmith 
and Elcock, 2001). 

As the technology has evolved and more experience gained, it has become 
more and more apparent that the most significant issues facing microarray users 
are the processing of the vast quantities of data generated and deciding exactly 
what tools are the most appropriate for data analysis. Because of the enormity 
of this, we have dedicated a complete section to describing the current status of 
this area. 



13.3 Data analysis 

It is important to be cognisant of the fact that the practical laboratory aspects of 
using microarrays are only part of gene expression analysis. Many researchers 
generate vast volumes of data, without a clear understanding of how to manage 
and interpret them. Furthermore, the variability in microarray data confers addi- 
tional problems for analysis. In some cases the purpose of the experiment will 
be a gene-hunting exercise, in which case a cursory indication of potential gene 
biomarkers is sufficient analysis. In other instances, such as pathway mapping 
and screening studies, it is paramount that results are statistically meaningful 
and valid. The next few sections detail some relatively simple analysis meth- 
ods and recommendations for the benefit of researchers with minimal statistical 
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Figure 133 The ideal microarray experimental design and process 

training. There are also suggestions for more advanced analysis for those who 
have the assistance of a statistician or specialist data analyst. 

Most of the steps before, during and after performing a microarray experi- 
ment are optimally conducted with regard for statistics and data analysis. Careful 
planning before implementation facilitates the downstream analysis and inter- 
pretation of data. The following model summarises the entire microarray process 
with integration of the biological and data analysis components (Figure 13.3). 

Hypothesis Generation 

Any study is conceived for the purpose of investigating or obtaining supporting 
evidence for a biological hypothesis. Giving time at this early stage to consider 
downstream implications will pay dividends later. It is helpful if, rather than 
simply stating the aims of the experiment, the researcher asks the question 'What 
results do I expect?* or 'what answer will validate/invalidate my hypothesis? 1 . 
This 'reverse-engineering' proves useful in focusing the project, assessing the 
feasibility of the woik, providing early preparation for data management and 
analysis and, importantly, in managing expectations with regard to outcomes. 

A good example of careful experimental planning is demonstrated by Golub 
etal. (1999) in the classification of acute leukaemias in order to distinguish 
between acute lymphoblastic leukaemia (ALL) and acute myeloid leukaemia 
(AML). Distinguishing between ALL and AML using conventional techniques 
is known to be a difficult task. The researchers maximised their probability of 
success by choosing an easier, more defined model (normal kidney vs. renal 
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cell carcinoma), on which Co validate their analytical methods. In doing so they 
established that their techniques were suitable for classifying tissues according 
to disease and gained confidence in their approach before using the samples of 
real interest. 

Optimisation Experiments 

Although microarrays are becoming increasingly accessible to all, using these 
tools requires experience and it is unlikely that successful experiments will be 
conducted immediately. It is usual that some time is given to optimising a sys- 
tem for any specific application, for example for a given tissue or cell type. 
Additionally, the requirements for a given system may warrant some modifica- 
tions. The standard approach for a scientist to take is to vary one parameter, 
whilst keeping all others constant. This is time-consuming and does not take into 
account the interactions between different factors. Well-designed, multifactorial 
experiments (Box et al. f 1978), provide a faster route for optimisation, with a 
statistical measure of confidence. An example of this technique is in the opti- 
misation of microarray experimental conditions for preparation of fluorescent 
probes from rat liver dssue (Wildsmith etal., 2001). When a major source of 
variation is revealed this can be investigated further with a view to minimising 
it or providing sufficient replicates to account for it 

Design of Experiments 

Once confidence in the experimental procedure has been obtained the researcher 
is likely to have gained an insight into the reproducibility of the system. This 
assists in the design of the experiments, in particular in determining the mini- 
mal number of replicates necessary. Replication can be implemented at many 
stages - from biological samples through to microarray slides. 

Owing to the enzyme-catalysed transcription reactions, a large amount of 
variation occurs during the probe-making stages in microarray experiments. Our 
work indicated that replicates should be made at this step and a minimum of six 
replicate probes are made for microarray experiments (Wildsmith et aL f 2001). 
These can be pooled or hybridised separately onto six microarray slides. 

Lee et aL (2000) have examined the effect of the different location of cDNA 
spots on the glass slides and concluded that replicates are essential to provide 
meaningful data and to enable reliable inferences to be drawn. 

With regard to commercially available gene chip systems, such as that avail- 
able from Affymetrix (see Figure 13.4), the variation between chips, within a 
batch, is likely to be low due to stringent quality control and highly automated 
manufacture. The use of an automated wash station also reduces variability in 
intensities between chips. However, using a multi-step approach in the probe 
preparation and subsequent antibody binding steps may lead to variation between 
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Figure 134 The GeneChipQ Instrument System. From left to right, the hybridisation sta- 
tion, scanner and workstation. Image courtesy of Asymetrix. A colour version of this figure 
appears in the colour plate section 



replicate, samples prepared on different days. Pooling of reagents within an 
experiment, and analysing controls together with treated samples, will both 
reduce the variability within a given experiment. 

Conduct of Experiment 

At this stage some attention may be required for verifying and validating pro- 
cesses. For example, checking that the imaging instruments give consistent 
results across the slide, on repeat use and from day to day. If two imagers 
arc used it is important to verify that the results from both machines are com- 
parable. Some laboratories read fluorescence of one channel and then adjust the 
laser intensity of the second channel in order to obtain comparable readings. 
This is a method of normalising for the difference in intensities of fluorophores. 
It is important to be aware that this approach has a number of drawbacks. The 
arbitrary value of the second laser intensity setting will vary from experiment 
to experiment; thus comparisons of this channel cannot be made across experi- 
ments. Also the response of the fluorophore may not be linear across the laser 
intensity settings and this can lead to additional errors. 

Another area for investigation prior to running the study itself is the image 
analysis component. Depending on the software used, the image analysis pack- 
age may process the data to some extent, for example automatic background 
subtraction. Full understanding of the software is required so that it is clear at 
what point the data are 'raw', and the extent of inherent, inseparable manip- 
ulation. Effort may be required to determine the optimum settings for any 
software parameters. 

As data are generated it is important to be aware of the data integrity - for 
example ensuring that all data are collected, so that there are no missing data that 
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Figure 13.5 The relative fluorescence units (RFUs) of 1248 genes on seven microar- 
ray slides that were. hybridised with cDNA matte from. the liver of. a .rat treated with 
acetaminophen. Note the gene outliers at approximately gene number 480 

can complicate analysis later. The researcher may be intuitively aware of any 
spurious results and should be alert for anything extraordinary that could indi- 
cate problems, for example hybridisation intensities appearing inconsistent from 
sample to sample. Data analysis at this point can be a rapid indicator of dubious 
results. For example, Figure 13,5 shows a plot of the fluorescent intensities of 
1248 genes that were hybridised with probe derived from acetaminophen-treated 
rat liver tissue. The data appear consistent, with the exception of peaks in inten- 
sity on one slide at around gene 4800. Further investigation of the microarray 
revealed a large artefact that had been missed by die image analysis process 
(Figure 13.6). 



Raw Data Generation and Storage 

One issue that arises when carrying out microarray analyses is how much data 
to store and in what form. For Good Laboratory Practice (GLP) purposes, often 
required in industry, storage of the raw data is necessary. This could be construed 
as the microarray image. Storing the image analysis results requires far less 
storage space and is easier to visualise, but it has the drawback that image 
analysis cannot be redone should superior software be available in the future. 
In reality, the methods used for microarrays are continually changing and the 
likelihood of revisiting old images on which the analysis has been performed, 
using outdated protocols is quite small. 
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Figure 13.6 Portion of scanned image showing region where artefact occurred that caused 
very high signals, which were classed as outliers 

Pre-processing 

A number of pre-processing steps are often used in microarray analysis. These 
include filtering, log transformation, normalisation and background subtraction. 
Filtering may be used before or after transformation in order to extract data 
from preferred regions of interest, or in order to remove outliers (see the 
above example relating to image analysis artefact). One example of filtering 
is the removal of individual gene replicates that lie outside a given number (for 
example 5) of standard deviations from the mean. Alternatively, data points that 
lie in the top/bottom few percentiles (e.g. 0. 1 %) of the data can be removed. This 
method of removing outliers is also called 'trimming'. It is acceptable if there 
is a large volume of data where only a small proportion of data is removed and 
if the same method is applied consistently across all data. Care must be taken 
in the way in which this is carried out in order not to delete genuine data. For 
example, if one gene is consistently high or low in expression across replicates, 
then it is unlikely to be an outlier. 

Another method of detecting outliers is to plot all genes (see the section 
'Conduct of experiment' above) or to perform PCA analysis (see the section 
•Multivariate analysis' below) to detect replicate outliers. The use of a PCA 
plot to detect outliers is shown in Figure 13.7. 

Log transformation of data is accepted universally because the fluorescence 
data that are generated from microarrays tend to be skewed towards lower val- 
ues. There are scientifically valid reasons why ratios of raw expression values 
should not be used (Nadon and Shoemaker, 2002). When using two-colour 
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Figure 13/7 PCA plot of data used in Figure 13.4 showing one microarray (number 2) as 
an outlier 



hybridisations it is common to express the ratio of treated to control as a loga- 
rithm in base 2 (Quackenbush, 2001). Thus genes up-regulated by a factor of 2 
have a log 2 (ratio) of 1, and genes down-regulated by a factor of 2 have a lo*> 
(ratio) of -1. ^ 6 

Normalisation and background subtraction techniques are methods of data 
manipulation and their use is more subjective and often debated. The purpose of 
these techniques is to reduce the error (variability) that occurs between replicates 
and thus enable a comparison of data across samples. 

The theory behind background subtraction is that during hybridisation there 
will be non-specific binding to the slide. This will effectively 'darken' the 
image and give falsely high readings of fluorescent intensity. Correcting for 
the non-specific hybridisation should reduce error due to background staining. 
Background subtraction often occurs automatically in microarray image analy- 
sis packages. The software may circle the spot of interest and use the region 
beyond the periphery as the measurement of background. In cases of uneven 
hybridisation this method enables locally high background to be subtracted on 
a regional basis. One criticism of this approach is that the slide surface beyond 
the periphery is not similar, in chemical terms, to that where the nucleic acid 
has been deposited, and therefore cannot act as a real control for non-specific 
binding. A more accurate measurement of non-specific binding can be gained 
from using a region where spotting chemicals have been deposited, but no target 
is present. This concept is the basis of a method using local 'blank spots' (Wu 
etai, 2001). 

A number of methods exist for normalisation of data. These include normal- 
ising to total signal or to a 'known' spot or gene, standardisation, or proprietary 
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methods. Normalising to total signal is the simplest approach, whereby the gene 
intensity is expressed as a percentage or proportion of the signal intensity for the 
entire array. This method works best when the total intensities for the microar- 
rays are similar and the number of changes is small compared with the number 
of genes. However, we often find, when using arrays of around 1000 genes, that 
pathological disease can up-regulate a large number of genes simultaneously. In 
this case, when total signal normalisation is applied, highly up-regulated genes 
will appear less up-regulated and genes that do not change from the control will 
appear down-regulated. 

Normalisation to a control value is a more popular technique. A control value 
can be obtained from using a gene known to remain constant under the condi- 
tions of the experiment. DeRisi et al (1997) used a panel of 90 housekeeping 
genes for normalisation, but found considerable variation in their gene expres- 
sion. Unfortunately it is very difficult to know with certainty that a gene will 
not change and there is evidence to suggest that so-called 'housekeeping genes' 
are variable (Savonet et al. f 1997). Other control genes can be derived from an 
alternative species; these should not be expected to hybridise. We have used 
yeast and Arabidopsis genes as negative controls for hybridisations of rat tis- 
sues. No orthologs were known to the genes selected; however, in . most . cases 
non-specific binding occurred. 

If two or more microarray replicates appear to be different, but they are 
expected to be the same, then they can b$ standardised. An example of this 
might occur if the total intensity of one microarray is greater than another, but 
the genes are proportionally equally up- or down-regulated. If the microarray 
sample spot data are assumed to be drawn from a normal distribution, then the *z- 
transform' can be used. This requires that the mean and the standard deviation of 
the intensity values for each microarray are determined. The mean is subtracted 
from each individual gene value and the remainder is divided by the standard 
deviation. The intensity values from each microarray will then have the same 
mean and standard deviation. This has the advantage of facilitating comparison 
of microarrays with different dynamic ranges as well as total intensities. If the 
data are not normally distributed, then alternative non-parametric methods can 
be used, such as normalising to the median. 

Univariate Analysis 

Univariate methods of analysis involve examining one variable, or gene, at a 
time. This can be a very laborious task when examining a large volume of 
data, yet it is the preferred method of biologists. The simplest technique is to 
compare control and treated values and express the result as a 'fold-change* 
ratio. Typically, when examining small volumes of data, fold changes greater 
than 2 and less than 0.5 are considered meaningful (Quackenbush, 2001). This 
cut-off is essentially arbitrary and has the distinct drawback that microarray data 
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are not homoscedastic; that is, there is more variation about the mean at low 
values than there is at high values (Draghici, 2002). 

A second method for finding up- or down-regulated genes uses the standard 
deviation (SD) of the replicate gene data. Thus if changes greater than, say, 2 
SD from the log mean ratio are considerably greater than changes associated 
with 'noise', then they are considered significant This technique means that 
when looking at a large number of genes that are normally distributed there will 
be up- and down-regulated genes, regardless of whether there are (biological) 
changes (Draghici, 2002). 

Rather than using arbitrary cut-off values it is far more meaningful to express 
the fold-change in terms of either confidence intervals, or a *p- value' (that 
is, the probability of the value occurring by chance). Thus, a fold-change of 
1.1 may be associated with a p-value of 0.001 and thus the probability that 
the gene is not up-regulated is 1 in 1000. Naturally, such small fold-changes 
may then be queried in terms of biological significance. One must then ask 
the question: Are large fold-changes more important (biologically) than small 
ones? Simple calculation of p-values for two data sets can be obtained using 
f-test functions in standard spreadsheet software. A number of replicates are 
necessary" for this approach, and the data* must be normally distributed. We 
have recently developed a method for calculating p-values for fold-changes that 
is not influenced by the distribution of the data or outliers and applied it to 
TbqMan™ and microarray data. Other complex and computationally intensive 
methods for calculating p-values are described in Draghici (2002) and Nadon 
and Shoemaker (2002). 

Multivariate Analysis 

Multivariate analysis of gene expression data is becoming increasingly popu- 
lar in the microarray community and in other biological domains where large 
volumes of data are generated. Multivariate analysis methods include princi- 
pal component analysis (PCA), factor analysis, multivariate analysis of variance 
(MANOVA) and cluster analysis. Currently, cluster analysis is the most widely- 
used method in the microarray community but PCA is growing in popularity 
(Crescenzi and Giuliani, 2001; Konu el al. 9 2001). 

Quackenbush (2001) provides a good review of clustering tools that is rather 
unique in the regard that different clustering algorithms and linkage methods 
are presented. Clustering methods are unsupervised, and they are powerful tools 
for gaining insight into huge data sets. They enable the data to be partitioned 
in order to facilitate interpretation; however, they do suffer from subjectiv- 
ity. This is because the user selects various parameters, such as the algorithm 
used, linkage type, distance metric and, sometimes, cluster size. Whatever the 
data, clusters will always be identified, thus there is also a tendency to over- 
interpret the data - trying to attach meaning to clusters that may have no bio- 
logical relevance. 
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Software available for cluster analysis includes Ouster, the output of which 
is viewed in Treeview; both available from http://rana.stanford.edu/ 
software. This tool is particularly useful for clustering genes to identify genes 
that are co-regulated. 

The PCA is a visualisation tool that enables complex, high-dimensional data 
to be represented in two or three dimensions. It facilitates identification of groups 
of similar data, thus enabling inferences to be made about the samples. 

An example is shown in Figure 13.8. The figure shows the gene data for one 
sample (control rat liver) that was hybridised to seven microarrays according 
to the method used in Wildsmith et al. (2001). Each microarray contained two 
replicate gene sets; thus there were 14 replicate gene sets in total. The gene sets 
comprised 1248 genes (with controls). All the data (14 x 1248 data points) was 
input into the analysis and the PCA plot displays the 14 replicates individually. 
The two axes are principal components 1 and 2. Principal component 1 (PCI) 
accounts for 65.5% of the variation in the date, whereas PC2 represents only 
13%. This means that the model accounts for 78.5% of variation in the data. 

The first principal component (PCI) accounts for as much as possible of 
the variation in the.original data and subsequent components (e.g. PC2) are of 
decreasing importance. Thus, samples 8 and 9 are-very different from samples 1 
and 2. In terms of interpreting the PCA plot, it is immediately clear that there are 
three or four distinct clusters of data. These are marked by circles. Datapoints 
tend to cluster in pairs; for example replicates 1 and 2, 3 and 4, 5 and 6, 
etc. These are the duplicate gene sets on the same microarray. This indicates 
that the variation within the microarray is lower than the variation between 
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Figure 13.8 PCA plot of the microaiTay results from seven slides (2690, 2692, 2723, 2869, 
2876, 2879, 2880), each with two replicate spot sets (labelled 1-14), after hybridisation with 
control rat liver. See text for explanation 
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replicate microarrays. However, given that the same sample is applied to all 
the microarrays, we must ask why we get further separation of the replicates. 
The answer lies in the associated data. The four-digit numbers associated with 
the clusters are the microarray slide numbers that indicate when they were 
printed. Numbers that are more similar seem to be more closely related, and thus 
we can hypothesise that there were some differences between the slides, such 
as differences in slide backgrounds or changes during the printing process or 
change-over of batches, between slides in the 2600-2700 region and the 2800s. 

The PCA provides a clearer overview of the data than does cluster analy- 
sis. It is a rapid method for gaining an insight into the results, in particular 
where biological meaning can be attached to the components (Crescenzi et a/., 
2001). There are a number of packages for multivariate data analysis, including 
SIMCA-P (Umetrics, AB, Umea, Sweden) and The Unscrambler (Camo ASA, 
Norway), both of which are useful for PCA. 

Other tools for data visualisation include software packages such as Spot- 
fire.net (Spotfire Inc., Cambridge, MA, USA) and GeneSpring (Silicon Graphics, 
San Carlos, CA, USA). Spotfire is particularly useful for visualisation of mul- 
tidimensional data and for visualisation of temporal data. It is possible to use 
these tools to identify genes (hat arc co-ordinately expressed over time. 

Biological Interpretation 

After developing a sound experimental strategy, ensuring that the results are 
statistically valid, and after analysis of the data, it is down to the biologist to 
assemble the pieces of information that have been obtained. This intertwined 
information may include unexpected results that are contradictory to intuition or 
to published literature. One way to untangle the data is to map the relevant genes 
onto existing pathways and known functions. The Kyoto Encyclopedia of Genes 
and Genomes (KEGG), available at http://www.genome.ad.jp/kegg/, is a 
useful source of information, especially where the gene products are enzymes. It 
enables visualisation of the position of up- or down-regulated genes in metabolic 
pathways. 

The gene expression data obtained may differ from protein expression data, 
or information on gene product activity or location. When initiating a study 
it is useful to consider additional endpoints that can assist in the interpreta- 
tion of the data. For in vitro studies, these might include cytotoxicity endpoints, 
metabolites, key signalling molecules or perhaps protein expression. Waring 
et ai (2001) used tetrazolium dye reduction (MTT) as a measure of hepatocyte 
cell viability for their studies of gene expression in response to hepatotoxic 
insult. For in vivo studies, expression information on the tissue of interest could 
be supported by pathology, histology and blood chemistry measurements. Gene 
expression results could be confirmed by in situ hybridisations or protein activity 
assays. 



284 GENE EXPRESSION ANALYSIS USING MICROARRAYS 

13.4 Recent examples of microarray applications 

One area of rapid progress using microarray technology is the increased under- 
standing of cancer. Molecular pathologists are subgrouping cancers of tissues 
such as blood, skin and breast, based on differential gene expression patterns. 
For example, within a small group of breast cancer tissue samples, Perou et al. 
(2000) distinguished two broad subgroups representing those expressing or alter- 
natively lacking expression of the oestrogen receptor-a gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared 
with the previous methods of gene identification. 

Another example of the impact of this technology is in the identification of 
two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran 
etal. % 2001). 

Microarray technology has also accelerated the understanding of the molecu- 
lar events surrounding pulmonary fibrosis. Specifically, two distinct clusters of 
genes associated with inflammation and fibrosis have been identified in a dis- 
ease where, for years, the pathogenesis and treatment have remained unknown 
(Katsuma et a/., 2001). 

13.5 Conclusions 

Important factors in gene expression experiments include sensitivity, precision 
and reproducibility in the measurement of specific mRNA sequences (Schmittgen 
et a/., 2000). These quality metrics can be maximised by using, or fabricat- 
ing, high-quality microarrays, and by optimising each step of the microarray 
process. From conception to conclusion it is important to bear in mind the 
original hypothesis. 

Having considered the complexity of the microarray experiment, the value 
obtained from a meticulously designed experiment should not be underesti- 
mated. As the number of high-quality gene expression studies increases, we hope 
that the literature will contain increasingly detailed information that will help 
interpret complex gene expression changes, and thus elucidate the mechanisms 
of disease. 
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13.9 Useful websites 

Protocols: 

http : //cmgm . Stanford . edu/pbrown 

http : //cmgm . Stanford, edu/pbrown/protocols . html 

Image analysis software: 

http : / /www . genome . ad . jp/kegg/ 
http : / /r sb . info . nih . gov/nih - image 
http : //rana . Stanford . edu/sof tware 

http : / /www . nhgri . nih . gov/DIR/LCG/lSK/HTML/img-analysis . html 
http : //rana . Stanford . EDU/sof tware 
http//www. tigr.org/softlab 
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INCREASING NUMBERS OF HUMAN Dis- 
eases, both acquired and genetic, are 
being considered to be based at least 
in part on alterations in DNA se- 
quence. For most diseases, inheritance 
and acquisition are likely to be com- 
plex and polygenic. The efforts of the 
Human Genome Project to elucidate the 
structural genetic background by iden- 
tifying the chromosomal positions and 
genomic organization of between ap- 
proximately 30000 and 35000 human 
genes are nearly complete. 1 Based on this 
structural knowledge, a byproduct 
should be a better "scaffolding" to help 
link specific genes to susceptibility to 
various human diseases. However, to un- 
derstand how the products of these ge- 
netic linkages work together to orches- 
trate the initiation and progression of 
particular complex diseases, there will 
be a need to apply a functional genetic 
rather than a structural genetic ap- 
proach. u 

Until recently, functional genetic 
studies have generally been of limited 
scope, only able to elucidate the role of 
1 or a few genes at a time in 1 system. 
Information on the specificity and rela- 
tive abundance of expression prod- 
ucts has traditionally been obtained by 
techniques such as RNA Northern blot 
hybridization and ribonuclease protec- 
tion assays. Somewhat more sophisti- 
cated methods, such as differential dis- 
play' and Serial Analysis of Gene 



DNA microarrays represent a technological Intersection between biology and 
computers that enables gene expression analysis In human tissues on a ge- 
nome-wide scale. This application can be expected to prove extremely valu- 
able for the study of the genetic basis of complex diseases. Despite the enor- 
mous promise of this revolutionary technology, there are several Issues and 
possible pitfalls that may undermine the authority of the mlcroarray plat- 
form. We discuss some of the conceptual, practical, statistical, and logisti- 
cal Issues surrounding the use of microarrays for gene expression profiling. 
These Issues Include the Imprecise definition of normal In expression com- 
parisons; the cellular and subcellular heterogeneity of the tissues being stud- 
led; the difficulty In establishing the statistically valid comparability of ar- 
rays; the logistical logjam In analysis, presentation, and archiving of the vast 
quantities of data generated; and the need for conflrmatlonal studies that 
address the functional relevance of findings. Although several complicated 
issues must be resolved, the potential payoff remains large. 
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Expression, 1 have been used to screen 
larger numbers of complementary DNA 
(cDNA) clones. However, technical 
limitations render these techniques 
nonconducive to large-scale genetic 
survey. 

To this end, a powerful new tech- 
nology is emerging, using hybridiza- 
tion to nucleotide arrays, the so-called 
gene chips. 6,7 This technological inter- 
section of biology and computers en- 
ables the reliable screening of a vast 
number of genes simultaneously and is 
amenable to automation. On a nylon 
membrane or glass surface, gene- 
specific cDNAs can be spotted, or oli- 
gonucleotides can be synthesized in situ 
by a combination of photolithography 
and oligonucleotide chemistry. This 
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permits simultaneous monitoring of the 
expression of thousands of genes in a 
single step. Individual chips can be cus- 
tomized to include any chosen set of 
fully or partially characterized ge- 
nomic or expressed sequences. Chips 
can monitor over 50000 unique se- 
quences. The power of these chips lies 
in the potential for comparative expres- 
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Deflnttton 
of Normal 



sion studies in diseased vs normal 
samples, and In documenting changes 
at different stages during the natural 
course of the disease or in response to 
treatment It provides the researcher 
with a new arsenal to analyze under- 
lying pathomechanisms on a grand scale 
and also to review the rationale of thera- 
peutic concepts. 

However, despite the enormous po- 
tential of this revolutionary technol- 
ogy* there are several issues and pos- 
sible pitfalls that attenuate the power 
of microarrays. First, the definition of 
normal in expression comparisons is 
neither precise nor unambiguous. Sec- 
ond, the heterogeneity of the tissues 
being studied complicates the mean- 
ing of the expression profiles. Third, the 
statistically valid comparability of ar- 
rays is an unresolved problem. Fourth, 
the vast quantities of data create a lo- 
gistical logjam for analysis, presenta- 
tion, and archiving. Finally, conilrma- 
tional studies are needed to corroborate 
the biological significance of microar- 
ray data (Figure 1). 

TROUBLE WITH NORMAL 

The standard normal vs diseased tis- 
sue type of comparison, which is the ba- 
sic design foundation of profiling stud- 
ies, may be more quicksand than 
bedrock. Normal is not so easy to de- 
fine — neither is diseased. Gene expres- 
sion in normal tissue is likely to be de- 
pendent on several factors involving 
patient and sample variation. These fac- 
tors will also have an impact on expres- 
sion profiles of diseased tissue. 

Patient Variation: Ethnicity, 
Sex, Age, Genetic Background, 
Disease States 

The ethnicity, sex, age, and genetic back- 
ground of a patient are likely to affect 
the gene expression profiles of many tis- 
sues to varying extents."-* A simple ex- 
ample is provided by the expression pro- 
files of genes involved in scalp and body 
hair follicle activity, which can be ex- 
pected to vary over a normal range un- 
der the influence of all of these sources 
of patient variation. The effects of these 
parameters on gene expression are likely 
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There are (nuec that complicate each step of mtcroarray gene expression anafysls. 



to be subtle but pervasive, not fully un- 
derstood at this time, and quite prob- 
lematic for defining normal. 

The presence of disease in a subject 
who is the source of tissue for control 
purposes, presents further potential 
variabilities. For example, there maybe 
a significant difference in the conclu- 
sions reached by 2 similar mtcroarray 
expression profiling studies. One may 
compare genes expressed in a pa- 
tient's diseased lung tissue with those 
expressed in normal, nondiseased lung 
tissue from the same patient, and an- 
other may compare genes expressed in 
the same patient's diseased lung tissue 
with those expressed in normal, non- 
diseased lung tissue from a healthy con- 
trol or normal individual. Moreover, it 
is also possible that seemingly unre- 
lated disease states may influence gene 
expression at distant sites. For in- 
stance, the presence of diabetes in 1 of 
2 renal cancer patients may compli- 
cate the direct comparison of renal 
tissues. 

Sample Variation: Proximity 
to Disease, Anatomic Location, 
and Developmental Range 

Yet another complication derives from 
the proximity of the normal tissue used 
as a control for the diseased tissue. Tis- 
sue adjacent to an area of disease may 
not be normal despite absence of evi- 
dence of disease clinically or under the 
light microscope. Normal-appearing tis- 
sue near a tumor could, for example, 
be gcnotypically altered or exhibit an 
altered gene expression profile. 1015 



Moreover, factors such as the degree of 
disease-associated inflammation may 
have a significant impact on gene ex- 
pression profiles. Other bystander ef- 
fects, epiphenomena, or secondary dis- 
ease processes could all play important 
roles in determining expression pro- 
files within these adjacent, so-called 
normal tissues. These factors must be 
considered in the choice of normal. 

The precise location within a par- 
ticular organ may be another impor- 
tant factor that affects gene expres- 
sion. 9 Por example, Just as location 
relative to the urethra may influence ex- 
pression profiles in the prostate, 14 skin 
from the nose, back, and palm are cer- 
tain to have different expression pro- 
files as well, despite all being from the 
same organ. Thus, site and specific ana- 
tomic location must also be taken into 
account in a description of normal. 

It must also be kept in mind that the 
definition of normal actually repre- 
sents a dynamic state." All tissues, 
which are composed of early and late- 
stage cells, have a normal developmen- 
tal range. For example, normal epithe- 
lium in prostatic ducts ranges from 
atrophic to resting to hyperplastic, and 
each has a unique pattern of gene ex- 
pression. 14 

A 3-dimensional analytic approach is 
a strategy that has been used to address 
some of these concerns about defining 
normal. Cole et al H used a 3-dimen- 
sional model to characterue the entire 
prostate gland in their study of gene ex- 
pression profiles In prostate cancer. In 
this study, whole-mount prostactec- 
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tomy specimens were dtvldcd Into trans- 
verse cross sections such that the entire 
prostate gland, including the complete 
spectrum of normal epithelium and tu- 
mor progression, was available for view- 
ing, microdissection, and microarray 
analysis. This method was used to de- 
termine the exact physical relationship 
of the normal ducts, premalignant le- 
sions, and tumors— thus obtaining an 
anatomic framework on which to over- 
lay gene expression data. This tech- 
nique offers several advantages over the 
normal vs tumor comparison. Previous 
studies had used normal epithelium in 
prostatic ducts as a baseline control 
against which to compare and contrast 
tumor gene expression profiles. 15 '" How- 
ever, the expression profile of this nor- 
mal epithelium is affected by proximity 
to tumor, location within the gland, and 
developmental state. 11 These factors can 
be better appreciated using a 3-dimen- 
sional approach. 

Disease-Related Variation 

Of course, many of the parameters that 
affect normal expression profiles (pa- 
tient ethnicity, age, sex, and genetic 
background, location within an or- 
gan, and developmental stage) will also 
affect disease expression profiles. lT " 
Disease heterogeneity, including sub- 
type, activity, severity, stage, and pre- 
vious as well as current treatments, also 
may have a significant impact on gene 
expression. 1 *" 

Categorizing and subgrouping pa- 
tients on entry into a study may be use- 
ful to control for as many of these fac- 
tors as possible. However, there may be 
problems surrounding attempts to de- 
fine microarray-based categorization on 
the basis of another imperfect categori- 
zation system, such as histology, as these 
groups are sometimes arbitrary or in- 
consistently designated. Nevertheless, de- 
termining whether gene expression pro- 
files correlate with existing clinical or 
histological categories can provide new 
insights into the meaning of these cat- 
egories as can new methods of classify- 
ing cancers or other diseases into spe- 
cific diagnostic categories based on their 
gene expression signatures. Several stud- 
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ies have been able to establish expression- 
based criteria (class predictors) for pre- 
existing categories and then use these 
new criteria to categorize new cases (class 
prediction). 1 *" 29 Global profiling may also 
allow the development of new classifi- 
cation systems based on gene expres- 
sion alone (class discovery). 2 * 10 

Thus, when possible, it will be of 
value to profile a range of normal and 
diseased cell populations from a num- 
ber of patients to distinguish between 
differences in expression that are rel- 
evant to the disease process and those 
reflecting the biological spectrum of the 
normal tissue or that have occurred for 
reasons unrelated to the disease. The 
significance of this distinction, is fur- 
ther appreciated when taking into ac- 
count the vast quantity of data gener- 
ated from microarrays and the potential 
for confounding interpretation from the 
inclusion of differential expression un- 
related to disease processes. 

It is worth noting, however, that the 
issues of patient and sample variability 
are not unique to microarray experi- 
ments. In fact, microarray experiments, 
in contrast to classic single-gene experi- 
ments, may actually provide the tools for 
identifying this heterogeneity. For ex- 
ample, DNA microarrays have been used 
to explore physiological variation in gene 
expression on a genomic scale in 60 cell 
lines derived from diverse tumor tis- 
sues. 31 Cluster analysis allows the iden- 
tification of prominent features in gene 
expression patterns that appear to re- 
flect molecular signatures of the tissue 
from which the cells originated. 31 

HETEROGENEOUS CELL 
POPULATIONS 

A further complication encountered with 
expression profiles is that any given tis- 
sue is composed of several cell types, 
members of which are likely to be within 
a spectrum of dynamic functional states. 
For example, a simple punch biopsy of 
the skin may contain keratinocytes, me- 
lanocytes, Langerhans cells, Merkel cells, 
adipocytes, smooth muscle cells of ar- 
rector pili, striated muscle cells of the 
panniculus camosus, blood cells includ- 
ing immune system cells, and cellular el- 

18 (Reprinted) 



ements of blood vessels, nerves, hair fol- 
licles, sebaceous glands, and sweat 
glands. Moreover, cells bum each of these 
populations will be at various stages of 
development and levels of activation, per- 
forming different functions and respond- 
ing to disease processes or treatments in 
different ways and to varying extents. The 
result is a highly heterogeneous sam- 
pling of cells, each expressing a unique 
set of genes. An expression profile gen- 
erated from a microarray study of the 
RNA in such a sample will thus repre- 
sent merely a snapshot of the genes ex- 
pressed by a plethora of cells at a mo- 
ment in time. Such extensive cellular 
heterogeneity complicates the ability to 
draw conclusions about specific pro- 
cesses occurring within a tissue speci- 
men. An illustrative example is pro- 
vided by Stanton et al," who used 
microarrays to identify genes differen- 
tially expressed during myocardial in- 
farction. The expression profiles they 
studied represented transcripts from cell 
populations as diverse as immune sys- 
tem cells, which migrated to the infarct 
region and are responsible for the in- 
flammatory response, cardiac myocytes 
within the ischemic area undergoing 
apoptosis and necrosis, fibroblasts un- 
dergoing proliferation and participat- 
ing in the formation of scar tissue to re- 
place the infarct, and cardiac myocytes 
undergoing hypertrophy to compen- 
sate for the loss of cells in the Infarct 
area." The issue of such cellular hetero- 
geneity was avoided by categorizing the 
differentially expressed genes into func- 
tional categories to look for patterns in- 
dicative of cardiac remodeling without 
attempting to attribute specific tran- 
scripts to specific cell types. For gene ex- 
pression studies involving samples with 
mixed cellular populations, further in- 
vestigation, such as with in situ messen- 
ger RNA (mRNA) hybridization, maybe 
necessary to localize the transcripts be- 
fore conclusions can be drawn about the 
roles of specific genes in specific cell types 
during the disease process. 

Laser Capture Microdissection 

An ingenious but technically delicate 
approach to the study of complex bio- 
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logical samples has become possible 
with the development of laser capture 
microdissection (Figure 2). m This tech- 
nique allows for the rapid and accu- 
rate procurement of cells from spe- 
cific areas of tissue under direct 
microscopic visualization, and thus 
makes the molecular genetic analysis 
of defined populations in their native 
tissue environment possible." Sgroi et 
al M demonstrated the feasibility of com- 
bining laser capture microdissection 
with high-throughput cDNA arrays. 
They showed that in vivo subpopula- 
tions of malignant cells from multiple 
stages of breast cancer progression 
could be separated from nonmalig- 
nant populations, and their expres- 
sion profiles could subsequently be ana- 
lyzed using microarrays. 

The potential is powerful Specimens 
could be separated into tissue layers; for 
example, separating a skin biopsy into 
epidermis, dermis, and hypodermis. Tis- 
sues could be further differentiated into 
specific structural components, such as 
dermis into blood vessels, adipose, ar- 
rector pili, and sebaceous glands. Struc- 
tures could be separated into defined cell 
types, such as blood vessels into endo- 
thelial cells, erythrocytes, and lympho- 
cytes. Cell types could even be sepa- 
rated into marker-defined subtypes, such 
as lymphocytes into CD4 and CDS cells. 
Expression profiles from refined and de- 
fined structures and cell types likely 
would be extremely valuable in the study 
of disease. 

Potential aside, there are significant 
limitations to this technology at the pres- 
ent time. The standard protocols for fix- 
ing and embedding tissue samples from 
surgical resections were not designed 
to be compatible with microarray ex- 
periments, with or without laser cap- 
ture microdissection. Typically, tissue 
suspected of being important Tor diag- 
nosis and staging is processed through 
aldehyde-based fixatives, such as for- 
malin, which damage mRNA integ- 
rity. 47 If frozen tissue is available, mRNA 
can be recovered and studied from dis- 
sected cell populations. However, fro- 
zen tissue sections are technically dif- 
ficult to prepare, the histology is often 



Figure 3. Laser Capture Microdissection 
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cection for further processing ^ retrieve ceffutarmatertab (eg, DNA, RNA, proteins). 



severely compromised, and the tissue 
available may contain only a limited 
portion of the lesion. 14 

Moreover, the sample amounts gen- 
erated from laser capture microdissec- 
tion can be small, even as miniscule as 
a single cclL M Consequently, the yields 
of RNA are low. Arrays have a thresh- 
old for the quantity of molecular start- 
ing material: at least 5 to 15 pg for oli- 
gonucleotide arrays and between 2 and 
100 pg for cDNA arrays, depending on 
the manufacturer, the source of the 
RNA, and the use of signal amplifica- 
tion.™ 40 Studies that have successfully 
integrated laser capture microdissec- 
tion with microarray technology have 
used samples of approximately 1 X 10* 
to 1 X 10 s cells with 95% to 98% ho- 
mogeneity as determined by micro- 
scopic visualization.*- 41 If needed, am- 
plification techniques may be used to 
generate sufficient genetic material for 
microarray hybridizations. 41 Laser cap- 
ture microdissection is an intriguing 
technology, but time will tell whether 
its potential is realized. 

Although some biological issues re- 
lated to gene expression may be com- 
plicated by the presence of heterog- 
eneous cell populations in studied 
samples, it is also true that some bio- 
logical conditions can be understood 
only in the context of these heterog- 
eneous cell populations. The nature of 
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global gene expression experiments is 
to uncover differences between 2 bio- 
logical samples, including those differ- 
ences based on diverse cell popula- 
tions. For example, to appreciate a 
disease that is characterized by an in- 
flammatory infiltrate, it must be un- 
derstood that the inflammatory infil- 
trate is part of the disease and is part 
of the difference between diseased and 
nondiseased tissue. Thus, the isola- 
tion of specific cell populations for 
study is not necessarily required or even 
desirable in all instances. 

MAKING MICROARRAYS 
COMPARABLE 

Ideally, microarray experiments should 
be comparable both within and be- 
tween laboratory or manufacturing sys- 
tems, but obtaining consistent and com- 
parable data is a critical challenge for 
microarray-based expression analysis. 
Major sources for the observed vari- 
ability of microarray data include the 
normal physiological gene expression 
variations in different samples and the 
noise introduced in the microarray as- 
say process. 41 

Physiological Gene 
Expression Variation 

Inextricably linked to the issues of pa- 
tient and sample variability and tissue 
heterogeneity discussed above, is the 
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problem of normal gene expression 
variations and how to distinguish these 
variations from significant disease- 
associated changes. Few studies have 
systematically investigated physiologi- 
cal expression changes, but data from in 
situ hybridizations suggest that normal 
variance for many tightly regulated tis- 
sue-specific genes can be within 20% to 
30%." However, there can be as much 
as 2- to 4-fold random fluctuations for 
many genes in yeast. 13 *" Affymetrix 
(Santa Clara, Calif) guidelines have sug- 
gested that for most of the "housekeep- 
ing" genes in human tissues, which are 
likely to be less tightly regulated, differ- 
ences of less than 4-fold are probably not 
biologically significant. 43 Conse- 
quently, a significant portion of micro- 
array data variability for high- or me- 
dium-abundance mRNAs may simply be 
due to normal expression variations. Sev- 
eral previous studies have used arbi- 
trary 2-fold change criteria to define sig- 
nificant expression change. 44 However, 
the 2-fold threshold has been shown to 
be statistically invalid even for dupli- 
cate experiments. 46 In a recent study that 
used cDNA microarrays to profile gene 
expression in samples of normal skin 
from breast-reduction surgeries, 71 of 
4400 genes were found to demonstrate 
variability in expression greater than 2 
SDs from the mean of each gene, 47 These 
included genes coding for transport pro- 
teins, gene transcription, cell-signaling 
proteins, and cell-surface proteins. Thus, 
physiological variation should be con- 
sidered in the analysis and interpreta- 
tion of rnicroarray data. More stringent 
criteria for defining significant expres- 
sion change may be useful. 

Noise In the A/llcroarray 
Assay Process 

For the tightly regulated (mostly low 
abundance) mRNA species, inconsis- 
tencies introduced at any stage of the 
microarray-based assay process may 
play a major role in data variability. 42 
Due to the miniaturization and the large 
number of genes involved, it is diffi- 
cult to maintain consistent processing 
conditions for each sequence across 
multiple assays, and obtaining accu- 



rate absolute signals is unlikely. 42 Noise 
may be introduced by slide heteroge- 
neities, printing irregularities (eg, 
pin-to-pin variations), and spotting 
volume fluctuations. 48 Some of the sys- 
tematic variations may be reduced by 
the inclusion of controls, but random 
fluctuation at various manufacturing 
stages cannot be completely con- 
trolled and can accumulate quickly in 
a complicated assay. 40 

In certain rnicroarray systems, 2 
samples are competitively hybridized to 
1 array using different fluors for label- 
ing. In other systems, there is only single- 
sample hybridization. A 2-color system 
might be expected to be more reliable 
since variations in spot size or amount 
of cDNA probe on the chip should not 
affect the signal ratio (both signals are de- 
rived from the same spot). However, this 
only holds true if signals are well above 
the background in both detection chan- 
nels. 42 In fact, the signal level for most 
of the tightly regulated genes will likely 
be close to the background level. 42 In ad- 
dition, background level on a slide can 
also vary significandy from spot to spot 
due to factors such as unevenness in slide 
surface properties, dust contamination, 
and incomplete washing, leading to high 
levels of signal variability for low- 
abundance mRNA species even in 
2-color systems 42 

The high levels of variability of rni- 
croarray data also mean that subtle 
changes in experimental conditions 
may significantly alter the results, mak- 
ing it difficult for separate laborato- 
ries to compare experimental data. In 
addition, the lack of standard con- 
trols, the predominant use of relative 
signals (ratios), and the adoption of in- 
compatible data formats contribute to 
poor comparability between studies. 41 

Despite the hard-wired variability in- 
troduced by chip manufacturing con- 
ditions, most of the published studies 
to date using microarray-based expres- 
sion analysis include only limited num- 
bers of replicates. 49 In fact, many stud- 
ies conduct the experiment only once. 
Considering the potential sources of as- 
say variation, the need for sufficiently 
replicated studies is underscored. 49 



Mlcfparray Data Normalisation 

Because of variability of rnicroarray data 
for single sample arrays and for fur- 
ther analysis of 2-color system arrays, 
each must be brought into the same 
scale to compare 2 or more arrays. How 
to perform this normalization of gene 
expression levels across multiple ar- 
rays, thus removing systematic varia- 
tion between the arrays and rendering 
different experiments comparable, re- 
mains an issue that is not yet fully re- 
solved* Many of the early rnicroarray 
studies in the literature simply ig- 
nored this issue, A more statistically rig- 
orous approach is needed. 

One difficulty has been that leading 
rnicroarray manufacturers have not 
published statistical error models for 
their products. Thus, users are un- 
clear how much to adjust data for varia- 
tions in spot intensity, hybridization 
patterns, and intensity measurement 
sensitivity. Software does exist to al- 
low for array-to-array comparisons by 
using a scaling factor to normalize gene 
expression patterns across arrays. How- 
ever, in general, these algorithms as- 
sume that intensity differences be- 
tween arrays are linearly related with 
a zero y-intercept. 51 This assumption al- 
lows software to trim the tails off dis- 
tributions of expression from differ- 
ent arrays at statistical cutoffs and then 
simply move the distributions along an 
axis to a common level to provide com- 
parisons. However, this linear relation- 
ship often does not hold true. 91 When 
the average expression level of 1 array 
is higher than that of a second array, a 
longer tail will be trimmed from the sec- 
ond array. Thus, a greater number of 
genes from the first array will be 
counted as being expressed because 
their expression level is above the sta- 
tistical cutoff point. In this case, the 2 
arrays cannot be considered compa- 
rable. 

Although bioinformatic software has 
recently been developed that offers 
more statistically robust normaliza- 
tion, the cost of these commercially 
available programs (combined with the 
already expensive microarrays) has 
been prohibitive for many research- 



2284 JAMA, November 14, 2001— Vol 286, No. 18 (Reprinted) 



©2001 American Medical Association. Ail rights reserved. 



Downloaded from www.jama.com at U.S. Patent and Trademark Office, on June 6, 2006 



DNA MICROARRAYS 



ers. 50 Standardization of these pro- 
cesses awaits the development of im- 
proved methods of normalization 
leading to valid statistical models widely 
available to all researchers. 

To this end, Schadt et al 31 have de- 
veloped a standard nonlinear curve 
technique for normalizing the data in 
arrays that do not demonstrate a lin- 
ear relationship between data sets. This 
model performs well when the 2 
samples being compared demonstrate 
a low number of differentially ex- 
pressed genes. However, when expres- 
sion profiles of 2 samples vary to a 
greater extent, Schadt et al 91 recom- 
mend a rank-selection method. Using 
this method, genes expressed on an ar- 
ray are ranked from highest to lowest 
level of expression. Then, for the ar- 
ray expressing a greater number of 
genes, the genes with the lowest ex- 
pression levels are removed from the list 
until the 2 Arrays Hst a comparable num- 
ber of expressed genes! "This type of 
rank-selection method has gained sup- 
port from other groups, but it too has 
limitations. 10 Removing low-expres- 
sion level data points restricts the study 
to the more extreme and easily de- 
tected entities, a technique that blunts 
the genomic-scale potential of micro- 
array technology. 

Efforts continue to improve compa- 
rability between arrays. Jones 50 re- 
cendy applied a statistical model to nor- 
malize spotted cDNA array data that 
takes into account not only the differ- 
ences in numbers of genes expressed be- 
tween arrays, but also the interarray 
variations in fluorescent dye intensity 
and mechanical error occurring in the 
printing process. Nevertheless, the is- 
sue of how to properly normalize ar- 
ray data has not been settled. Research- 
en must continue to demand statistical 
rigor in their comparisons before they 
can believe the mathematical results of 
their data. 

LOGISTICAL LOGJAM 

Microarrays deliver massive amounts of 
data on tens of thousands of genes. The 
result is an immense quantity of bio- 
logical information that must be ana- 
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lyzed, presented, and archived in a 
meaningful way. 

Data Analysis 

In human studies, the number of hy- 
bridizations that can be performed for 
any set of experimental conditions is of- 
ten restricted by the limited number of 
obtainable tissue samples and by the ex- 
pense of arrays. Restricted numbers of 
hybridizations for each experiment 
hamper the ability to assess the bio- 
logical significance of variation within 
or between given sets of conditions. 
Thus, for the assessment of thousands 
of genes in a setting of limited hybrid- 
izations, the importance of reliable and 
sophisticated algorithms for data analy- 
sis becomes amplified. 31 

A logical beginning is to examine the 
extremes, that is, genes with signifi- 
cant differential expression in indi- 
vidual samples. For example, a com- 
parison of 2 samples can be visualized 
in the form of a simple bivariate scat- 
terplot in which the expression profile 
of i sample (x-axis) is plotted against 
that of the second sample (y-axis). The 
distribution pattern generally demon- 
strates that the expression ratios clus- 
ter around the line in which x is equal 
to y (indicating comparable levels), with 
individual genes falling varying dis- 
tances from this line. Additional lines can 
be placed on the scatterplot to repre- 
sent various fold changes of expres- 
sion. Data points that fall above or 
below these lines represent genes ex- 
hibiting expression ratios greater or less 
than the specified fold change. Thus, one 
can begin by examining those genes that 
demonstrate a 10-fold or greater change 
in expression level. To expand the num- 
ber of genes under investigation, one can 
examine genes that demonstrate a 5-fold 
or greater change, or a 3-fold or greater 
change, and so forth. Many studies de- 
fine a 2-fold or greater change in ex- 
pression level to represent significant dif- 
ferential expression. The 2-fold 
threshold, however, as noted above, has 
been shown to be statistically invalid. 4 * 
Although this simple technique can be 
efficient and effective for focusing on ex- 
panding sets of differentially expressed 
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sequences, again, such an analysis does 
not take advantage of the full potential 
of genome-scale experiments to en- 
hance our comprehension of cellular bi- 
ology that would be provided by an in- 
clusive analysis of the entire repertoire 
of transcripts in a cell as it goes through 
a biological process. 12 A more holistic ap- 
proach, which allows the deciphering of 
patterns from the entire data set, is 
needed. 

Data Organization 
and Presentation 

Statistical algorithms can be applied to 
detect and extract patterns within pro- 
filing data. It is a basic assumption of 
many gene expression studies that 
knowledge of where and when a gene 
is expressed provides information about 
the function of the gene. Therefore, an 
important beginning is to organize 
genes on the basis of similarities in their 
expression profiles.'* However, even 
this basic tenet deserves critical con- 
sideration. Similarity of gene expres- 
sion profile does not mandate similar- 
ity of function or mechanistic pathway, 
and it maybe purely coincidental. Nev- 
ertheless, the idea of clustering genes 
on the basis of their expression pat- 
terns is well established and cluster 
analysis has become the most widely 
used statistical technique applied to 
large-scale gene expression data. 31 

Although various cluster methods can 
usefully organize tables of gene expres- 
sion measurements, the resulting or- 
dered butsull massive collection of num- 
bers remains difficult to assimilate. Thus, 
another important component of ge- 
nome-wide expression data explora- 
tion is the development of powerful data 
visualization methods and tools. Ap- 
proaches have been developed that pre- 
sent clustering results in simple graphi- 
cal displays such as dendrograms, which 
represent relationships among genes by 
a tree whose branch lengths reflect the 
degree of similarity in expression be- 
tween the genes. Similarity is mathemati- 
cally defined. 51 The computed trees can 
be used to order genes in the original data 
table such that genes or groups of genes 
with similar expression level patterns are 
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Figure 9. Real-Time Polymerase Chain 
Reaction (PCR) 




A, Complementary DNA synthesis from source mes- 
senger or total RNA proceeds as with traditional re- 
verse Iran scrip Use PCR. 6, A dual-labeled Auoro* 
genk probe (with a higher melting point than the PCR 
primers used for extension) h annealed to the target 
sequence between the forward and reverie PCR prim- 
ers. A reporter (R) Ruorochrome (usually 6-carboxy- 
fluorosceln) is attached to the 5' end of the probe and 
a quencher (Q) molecule (usually 6*carboxy- 
tetramethyl-rhodamine) ft attached to the 3' end. 
C, As Taq polymerase extends In the S' to 3* direc- 
tion, the dual-labeled probe begins to be displaced from 
the target sequence. D, As the Taq polymerase con- 
tinues to extend, the 5' to 3' endonudease activity 
cleaves the reporter (R) molecule from the probe se- 
quence such that Its emission spectra (91 8 run) are no 
longer quenched by the second (Q) fluorescent dye. 
Fluorescence Is measured continuously throughout the 
PCR amplification In real time and Is proportional to 
the amount of PCR product generated in each cycle. 

placed adjacently. Clustering methods 
can also be combined with representa- 
tion of each data point with a color that 
quantitatively and qualitatively reflects 
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the original experimental observa- 
tions. 11 Visual assimilation is then more 
intuitive. 

Data Archiving and Mining 

Ultimately, successful interpretation of 
gene profiling studies is likely to be de- 
pendent on the integration of experi- 
mental data with external information 
resources. As multiple experiments in- 
volving multiple cell types and tissues 
from multiple laboratory groups accu- 
mulate, data archiving may well be- 
come the watershed issue. Ideally, all 
data, in a suitably standardized form, 
would be freely accessible in the pub- 
lic domain. Even assuming a willing- 
ness. l.q share the data, such Utopian 
goals would require a user- friendly and 
powerful database system and stan- 
dardization of correction and normal- 
ization procedures such that data points 
from various projects become compa- 
rable. 55 The National Center for Bio- 
technology Information Entrez sys- 
tem (http://www.ncbi.nlm.nih.gov 
/Entrez/) does provide useful data in this 
regard, but current databases may be 
limited in scope or computability." A 
major focus of infrastructure develop- 
ment to support genomf c-scale expres- 
sion studies will need to be in the area 
of electronic biological pathway data- 
bases and resources. 

CONFIRMATIONAL STUDIES 

The development of more sophisti- 
cated analytical algorithms and data- 
bases will help lend credence to the bio- 
logical significance of differential gene 
expression determined by microarray 
analysis. In the meantime, several stud- 
ies have begun to examine the sensi- 
tivity and specificity of microarray- 
based experiments. Sensitivity, defined 
as the minimum reproducible signal de- 
tected by a given array scanning sys- 
tem, has been reported for microar- 
rays to be approximately 10 mRNA 
copies per cell, which is slightly infe- 
rior to the sensitivity of Northern blot 
analyses.* 57 Specificity studies showed 
that for a given probe any nontarget 
transcripts with more than 75% se- 
quence similarity may show cross- 



hybridization.* The problem of clone 
misidentiflcation and the need for clone 
confirmation have also been ad- 
dressed* One study found that of 1 1$9 
bacterial stock cultures, only 62.2% 
were uncontaminated and contained 
cDNA inserts that had significant se- 
quence identity with published data for 
the ordered clones. 5 * Thus, the use of 
sequence-verified clones for cDNA mi- 
croarray construction is warranted. 

Additionally, potential gene candi- 
dates can be assessed for relevance to 
disease using parallel technologies. Sev- 
eral such alternative platforms have 
been used to bolster the importance of 
specific sequences first suggested in 
gene chip comparisons including (1) 
methods at the RNA level, (2) meth- 
ods at the protein level, and (3) func- 
tional studies. 

Methods at the RNA tevel 

Reverse transcriptase polymerase chain 
reaction (RT-PCR) is a method often 
used to verify microarray data. Al- 
though RT-PCR is not well suited to 
quantitation, the relative technologi- 
cal case of this assay and the ability to 
rapidly monitor multiple samples make 
it a useful technology.* 0 * 61 Hybridiza- 
tion data can be verified and multiple 
putative markers can be screened in a 
short period. 

Several other studies have used real- 
time quantitative RT-PCR (TaqMan, 
PE Applied Biosystems, Foster City, 
Calif Real-time PCR is a tech- 
nique that increases the quantitative 
ability of RT-PCR by providing accu- 
rate and reproducible information 
on RNA copy number (Figure 3). in 
this method, a fluorogenic probe 
(labeled at the 5' end with a reporter 
fluorochrome and at the 3 f end 
with a quencher fluorochrome) is 
annealed to 1 strand of the target 
cDNA sequence between the forward 
and reverse PCR primers. As Taq poly- 
merase extends the forward primer, its 
intrinsic 5' to 3' nuclease activity dis- 
places and degrades the dual-labeled 
probe, releasing the reporter fluoro- 
chrome from the quencher label and 
allowing the detection of a fluorescent 
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signal that is proportional to the 
amount or PCR product generated in 
each cycle, 63 

Northern blot analysis is also com- 
monly used as a confirmational tech- 
nique, as it is a standard specific and 
semiquantitative method. 15 ' 1761 For 
rnRNA expressed at moderate-to-high 
levels, and for which cDNA probes are 
available, Northern blot analysis works 
well, but it is not well suited for low- 
copy rnRNA.** 5 Furthermore, only a 
small number of genes can be ana- 
lyzed with this conventional method. 

Methods at the Protein Level 

DNA micro array technology is limited to 
the study of gene expression at the 
rnRNA level However, it has been es- 
tablished that rnRNA levels do not nec- 
essarily correlate with protein levels. 
Moreover, the level of expression or even 
presence of a protein is not tightly linked 
to physiological consequences. An in- 
vestigation conducted by Winzeler et al* 
for example, provides a cautionary tale. 
Their study demonstrated that genes up- 
regulated in yeast growing in minimal 
medium did not prove to be more im- 
portant for growth than genes that were 
not unregulated." They found only 2 of 
8 genes required for yeast growth in 
minimal medium to be induced. The les- 
son to be learned is that genes that are 
not differentially expressed may be of 
equal functional importance in disease 
states compared with those that are. 

Furthermore, the regulation of some 
genes maybe at the translational rather 
than the transcriptional level, which 
would preclude detection by DNA mi- 
croarrays. Posttranslational modifica- 
tion of proteins is also an important 
mode of regulation that cannot be de- 
tected by DNA microarrays. Protein ac- 
tivity, particularly receptor activity, is 
heavily dependent on phosphoryla- 
tion, for example. DNA and rnRNA re- 
veal nothing about whether a given pro- 
tein is active, and can be deceptive when 
used to speculate about quantities of 
proteins, li has been demonstrated that 
the correlation between rnRNA and pro- 
tein abundance is less than 0.5, rtT em- 
phasizing that ideally, rnRNA expres- 
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sion studies should be accompanied by 
analyses at the protein level." Radio- 
immunoassay and immunohistochem- 
istry have been used in a number of 
studies."* 8 * 9 These techniques, how- 
ever, are not well suited to detecting low 
levels of expression, and they require 
the availability of an antibody specific 
for the protein to be studied. 

The field of proteomics, the large- 
scale parallel analysis of the proteins that 
are present in a cell, is developing rap- 
idly, but has problems of its own. Pro- 
teins vary in abundance by many orders 
of magnitude within a given cell, and 
there is no PCR equivalent for the ampli- 
fication of proteins. Moreover, proteins 
fold in many known (and unknown) 
ways that affect their function. The fea- 
sibility of the microarray analysis of pro- 
teins has begun to be explored. Antibod- 
ies attached to microarrays can be used 
to bind to and quantitatively detect pro- 
teins that have been tagged with fluo- 
rescent dyes. 10 Skeptics doubt the plau- 
sibility of identifying thousands of 
unknown proteins in this manner. 70 The 
diverse chemistry of various proteins 
poses serious difficulties, and it will be 
challenging to find antibodies for every 
protein. Thus, although it is important 
to incorporate protein analyses into 
expression profiling studies, current plat- 
forms are technically limiting. 

Functional Studies 

Confirming the role of a gene Initially 
identified in a microarray experiment in 
animal models with transgene or knock- 
out studies provides a particularly pow- 
erful alternative platform. Transcript 
function, rather than mere presence, is 
addressed. However, this approach is ill- 
suited for high-throughput conditions. 
U may be ideal for an in-depth investi- 
gation of 1 or 2 genes of interest, but it 
is not practical for confirming large quan- 
tities of profiling data. 

Confirmational studies are useful to 
corroborate the biological signifi- 
cance of differential gene expression de- 
termined by microarray analysis. While 
improved databases and more reliable 
statistical models will help to lend 
greater authority to array data, alter- 
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native platforms can be used to assess 
the relevance of genes first identified by 
array comparisons. It should be real- 
ized, however, that the alternative tech- 
nologies are not intended for large- 
scale analyses. Realistically, only 
selected sequences from the array data 
can be confirmed with other plat- 
forms In the short-term, a retreat from 
the initial purpose of the genome- 
scale investigation by microarray. 

CONCLUSIONS 

Microarrays can be expected to prove ex- 
tremely valuable as tools for the study of 
the genetic basis of complex diseases. The 
ability to measure expression profiles 
across entire genomes provides a level of 
Information not previously attainable. Al- 
though complicated issues must be re- 
solved, the potential payoff is big. Mi- 
croarrays make it possible to investigate 
differential gene expression in normal vs 
diseased tissue, in treated vs nontreated 
tissue, and in different stages during the 
natural course of a disease, all on a ge- 
nomic scale. Gene expression profiles 
may help to unlock the molecular basis 
of phenotype, response to treatment, and 
heterogeneity of disease. They may also 
help to define patterns of expression that 
will aid in diagnosis as well as define sus- 
ceptibility lod that may lead to the iden- 
tification of individuals at risk. Finally, 
as specific genes are identified and their 
functional roles in the development and 
course of disease are characterized, new 
targets for therapy should be identified. 

Despite the problems of defining nor- 
mal, understanding tissue heterogene- 
ity, making arrays comparable, analyz- 
ing and archiving massive quantities of 
data, and performing confirmational 
studies in alternative platforms, expres- 
sion profiling with microarrays stands 
as a truly revolutionary technology. As 
we continue to delve into the possibili- 
ties, we will surely progress in our un- 
derstanding of current issues and com- 
plications. No doubt the ride on the 
high-throughput highway will be ex- 
hilarating. 

Author Contribution: Ms King and DrSlnha both con- 
tributed to the conceptualization and wriUna of thk 
article. ° 
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Proteomics 



Kaifaryn Lilley, Azam Hazzaq and Michael J. Decry 



17.1 Introduction 

There are now numerous organisms whose genome sequences are known, for 
example Arabidopsis (Arabidopsis Genome Initiative, 2000), Drosophila (Adams 
etai 9 2000), human (International Human Genome Sequencing Consortium, 
2001) and rice (Yu et al., 2002). The prediction of open-reading frames from 
these genomic sequences has enabled the comprehensive identification of many 
putative protein sequences. These proteins can be arranged into three categories, 
namely those of known function, those with recognisable motifs and hence a 
vague idea of function, and those with no sequence similarities to any pro- 
tein (Gabor Miklos and Malenszka, 2001). Many proteins reside in this latter 
* functional vacuum' which could represent as much as 30% of the predicted 
proteins. Determining protein function is key to understanding cellular mecha- 
nism. Studying how protein expression is modulated in response to a given set 
of circumstances, such as infection, disease, developmental stage, senescence or 
response to drugs, will facilitate the elucidation of disease pathways and thus 
provide a mechanism for diagnosis and therapy. 

DNA chips (mRNA profiling studies) can contribute to the study of gene 
expression in response to a particular biological perturbation. However, the 
extrapolation that changes in transcript level will also result in corresponding 
changes in protein amount or activity cannot always be made. Tb understand 
fully, we need integrated data sets from a variety of protein expression studies, 
providing information on relative abundances, sub-cellular locations, protein 
complex formation and the profiling of isoforms generated by either alternate 
mRNA splicing or post-translational modifications. Proteomics is the word now 
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commonly used to describe the discipline associated with the acquisition of 
these data sets. 

17.2 Definitions and applications 

Before embarking on an overview of the techniques used in the field of pro- 
teomics, it is necessary to define a few terminologies; pioteomics, functional 
genomics, structural genomics and post-genomics are terms that have crept into 
the scientific vocabulary with alarming stealth and are used freely, and in some 
cases interchangeably, especially in cases where die use of such /buzz words* 
is likely to increase the attendance at departmental s££*inars. 

• Genome: A genome represents the entire DNA content in a particular cell, 
whether or not it is coding, non-coding, or is located either chromosomally 
or extra-chromosomally. 

• Genomics: Genomics is the study of the genome, interrogating the complete 
genome sequence using both DNA and RNA methodologies. 

• Proteome: The proteome represents the complete set of proteins encoded by 
the genome. 

• Proteomics: Proteomics is the study of the proteome and investigates die eel* 
hilar levels of all the isofonns and post-translational modifications of proteins 
that are encoded by the genome of that cell under a given set of circumstances. 

• Functional genomics: This is the study of the functions of genes and their 
inter-relationships. 

Whilst a genome is more or less static, the protein levels in a particular cell can 
change dramatically as genes get turned on and off during the cell's response to 
its environment The proteome originally was defined over seven years ago as 
'all the proteins coded by the genome of an organism' (Wasinger et al. t 1995). 
Nowadays the term 'proteomics* is used more widely and implies an effort to 
link structure to function by whatever means are appropriate. We can expect the 
definition to change again with time as the field and the investigator's view of 
it evolves. 

17 J Stages in proteome analysis 

Proteomic analysis can ascertain function either by looking for changes in the 
expression of either all or a subset of proteins, or by identifying binding partners 
for particular proteins and seeing how their interaction is affected by biological 
perturbation. Whatever the rationale of the investigation, or the number of pro- 
teins involved, the study of the proteome can be broken down into the following 
stages of analysis. 
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Abstract Novel and powerful techno logtes such ai DNA 
mleroarrays and srobtomJcf bora mads possible the aoilyfb of 

'the cxprcsdoa tcvffti of matt^le esses tlntufff "^iwffr both Id 
bcaUb en^tisense. Id cemhhietron, csese rnehnotogfci prembeto 

medicine so they km* expected to reveal gtnt rcgufeta crests 
btohed to disease prc^reshto ts«tOnt> olopotM patrona! 
tercet* few dreg dfecorcry cat 4Ngnrwttm Hare, wo mtew tat 
csrnai statns of these techno lagics end Mgbl&l ooan studies to 

tiles. Published by EtoH*? rlgtts itui 

Aey writ; Gene expiation profile;. Mlcmrnw; Protectee; 
TwcHSiratssioaej gel* electrophoresis; liBmtuiobistoctemiitiy; 
Bladder cancer 



1* fanrodnetfoo 

It b now e little more then 10 yean since the Knmao 
Genome Project was bunched and during Ihb relatively abort 
period ef timt cheic have beta rccoartabm entrances to the 
oooftntedon of physical and genetic maps as vefl aa to the 
identification of genes associated with homo domes (1,2). 
Soon, the total sequence of the human genome will be ded. 
pbrred and, bopefuBy, made available to researchers world* 
wide for the beneSt of najikmd. 

Undoubtedly, die Human Oeaone Ptoject hat paved the 
way to the revolution to the hie sciences that we ore tapeti- 
ceciag today. Gradually, however, its focus b starting to shift 
towards functional genomics, oe area of the posvgeaomk cm 
that deals with the functional anaJysb of genes and their 
products (sec the article by Oofieau to this issue). Techniques 
of functional genomics include methods for grcc a predion 
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profiling at the transcript (DNA mtcroamys |>S|: ace also 
vtide by Brums and Vitoto thb bsve); differential display 
(6); serial aaatycb gens c ap imi uu p-9j, and protein leveb 
(pioteotaics) (1 10-13); sec also o/tfete by Andersen tod Mean 
to thb ssmaX m e*J) ju truugetitos |I3*. phage display |l«l 
precedence for studying pronto pmitia totcsmeticms 415,16); 
see aso artide .by Ugram and Sclig to this bsue) and bioto- 
fomiatUi f}Tl 

Among the technique* of ftotcsioagJ geooaics. both DNA 
caieffosfffftyi qy-S\ ami letotna thereto) and pro tc^^ 
12) held great ptombe for the study of oomptai btotofjca! 
•yatems with appCcotsata to footocutax medicine. These novel 
tad powtfful gene expression profifing cechniqnei perrnit the 
aBalysig of the cjpjwiua levels of thousands of genes samfl- 
tsncously both to health and disesxe. These tecsntdogbo are 
ccj Tn| i b i MUlaj y. aDow hign^hjocu^minv and to coifiMnottoo 
•re capocted to gesstrste o vast sunount of gene end protein 
ii|iiniT A*** A»t t^i « T > ^irTfrrTHrrrg rf ihr 
rsgutotory events hrvohed to oortoal end disease proouscs. In 
s^dsuon, these technologies offer • systematic approach' for 
searching for efTectht targets for drug discovery tod diagnos- 
tics. 

Here, we sevfanv the current state of DNA wlauaiia ys and 
protewerics and highlight some studies to which they have 
been applied to concert to the analysis of biopsy frf """f 

X DNAmtooarreys 

The amount ef information that b now becoe nbg available 
to researchers to the Use iriinm b cxptodtog, and cveo 
though the data can ho stored to eoaventtoaal medto, new 
inemods art being required to anafysa large seu of genes to 
a high-throughput fashtoa. For thb purpose the DNA array 
taehnology was deveJopeet The method mako it possibb to 
survey thousands of genes in parotid, and has several areas of 
eppbation. Oat b caprcsston coonrtortog (It), to which the 
tiaoscnpt leveb of genes are rneacnired to differeni physiology 
tea) condittoos both in cultured oefls and tissues, to search tor 
regulatory cipvession patterns. Understanding patterns of ex- 
pressed gena b capected to improve our knowledge of highly 
complex, networks that cross eoawtunioite to hilhcrto un* 
known ways, both to .health aod disease. Another area of 
application b polymorphism analysts |I9> In this case* poly- 
morphic regions of the genome are scanned to search for link- 
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age to diseases, and 10 rival disease suioeptibility genes andf 
or inherited disease genes. A simitar approach has bees used 
to analyze polymorphic regions of known gene* in p%itrctfnr 
to determine whether polymorphisms are associated with en 
altered function of the gene product, a fact that may i&crease 
the ruseeptibOiry to disease. Finally, various otiempts have 
been made to utilise DNA arrays for seq renting (20). * 

Below we renew the technology currently to tuc for raitro- 
array-georraicd gene cxprcstoo pattern discovery as veil as 
some applications. Table 1 provides a set of World Wide Web 
rites that contain useful additional mfotanatioa (sec also the 
article by Branna and Vito ia this issue). 

Z, /. MizromrCy tec&notogy 

Microarrayi are usual*/ made by deposition or DNA spots 
on a solid support like a coated glass surface, that differs m 
. several ways from conventional filter-based aupporta such as 
charged oyton and nitroeeOulose. The flatness of the g*»?t 
surface makes it possible (i) to amy molecule* n ■ parallel 
fashioo. 0>) to nuniaturoe the procedure and (iii) to use flu*, 
rcsccat dyes for detection. There is no diffusion of the applied 
material into the rapport, thus allowing focusing for laser 
scanning microscopy. 

Two main procedures have been used to produce DNA 
chips: photolithography as developed and naifcctcd prinaxQy 
by Affrntetrix Inc. (Santa Clara, CA, USA) PCUIJ and on> 
. chanhal grjddmg (22) Pnotoiithoevuphy b wtfl known in the 
computer chip industry and utilises an ultraviolet light somas 
that passes thioogh a mask that directs In a itep-wbetoamxr 
whete a photochemical reaction <oUgooudeotide synthesis) 
takes place on a sfficonised glass surface. The mask can be 
piuduoed with openings as small as a few micrometers aOew- 
big a density of several hundred ,K nmrr1 probes per square 
ce n ti met fw of glass. There b, however, an inltcftnt length n> 
■tnctfon with this in situ synthesis technology Gcafttag lbs 
probes to about 25 oucfeotida in length. H*s>4cnsity arrays, 
on the other hand, allow the use of multiple probes per gene 
poj. 

Mechanical gridding methods are based oa rnk-jet or phys- 
ical deposition of the material using pins manufactured with 
very nigh prccisioo. There is direct surface contact and the 
transport of small amounts of bsjuk) makes these systems 
susceptible to evaporation and oontaatinatioa with dost par* 
tides. The grinding instruments use an XYZ motion control 
based oa sup engines that can- be controlled with very high 
precision. The DNA containing material can he spotted from 
96 or 3M weD plates to glass to predefined patterns. 

The arrayed probes can be oKgos (photolithography and 
asiddmg) or cDNAs (gridding). The hybrid nation reaction 
conditions are quite Afferent in the two eases, and special 
sample pi* pari dons am needed to optimally utilise them 
probes. With shorter probes, Le. of 20-30 nucleotides, the 
sample* is fragmented to avoid tertiary structures and to 
achieve optimal bybttaiabon (18,201. Polymerase chain reac- 
tion (PCR) amplified probes of 300-2000 nucleotides usually 
do not require fragmentation of the sample (23). 

The type nf glass used as support <hc coating substance, 
the coupling technique, the labelling system, and the thtores* 
cent labeb used for detection ore all variables that must be 
optimized. A number of coating substances are commercially 
avaibbte to immobilise DNA to the surface. Two types of 
attdes arc available that use a coating procedure designed 



for printing of amine modified DNA: these include (9 sir/* 
tatcd slides, which contain reactive aldehyde groups that react 
with anuoo-gioups via a Schiff base formation as the printed 
DNA dries oa the surface of the slide (avaimbfe from Tele- 
chem Inc. CA. USA; Call Associates toe, TX, USA; for 
coupling cnemistry sec e^Avww^rmyitxernm^ 
coupling/) and fo) activated slides prepared by the oovalent 
attachmem of a bydrophilic, polymeric amine reactive costing 
to sitene base-coated sfides as described by Brier ncd Homnsel 
P4) (available from Soraaediea Inc. MS, USA). Amine-modi- 
nad DNA attaches covalently to the activated, polynirric rur* 
face. 

Three other types of slides arc available that are based on 
more conventional mnnchflbatien technology ir/trtintty used 
in mctobrane tanxocbiioation: these mcfude (i) rfJaahed slides 
which carry cevalcouy attached primary amines on the sur-. 
fane that can form tonic bonds with the phosphate bsth bo uc 
at neutral pH. In aattitien, tfte rtdfcs^4»^couphng la neon 
thymidine rcfttucs on the DNA and carbons on the alkyt 
amine of the sabstretc can be induced with UV or hcat<(pro~ 
need by omnpasnm Ske Tctnehem ln&. CA. USA; Stssna 
Ablr^lnc^a^ fac. v NY. USAK (b*) nitro- 

aad mvnobffissUton propeitits of ntocccJrataw that 'btodS 
DNA in a noa-covesnst but snevcrsibfc msxaur (Schleicher 
and Schueu% and (Si) ssefy4yau»>«oatcd slides which feejoiie 
UV cnm&nking of the DNA (available from Sigma AMrfch 
toe. Ma USA; see oho e*tp /fr mgm ann f ofdbe dcV bbs uw n / 
pmocols/LjIhlefccttmJ). 

1J. QuanrUattni fAr Jfnab/rtm amyj 

A linear response that covers two or three orders of mag- 
nitude to often needed to detect tow and high copy number 
transcripts on the same array. In cases where this is not pea- 
sibie it may be necessary to scan the chip at different wave* 

top of the bound sample (23). Ia the latter cava, the first 
scanning ti carried out niter hybridization of the labelled sam- 
ple, and the second after reaction with the tshefled antibodies. 
It Is fvacessaiy to dotummt the hnearfty and ccgroduciMllty 

to probe to obtain reliable data. Often, a sta^etesd sample b 
used to compare wfth the experimental sanrpte cad this may 
crnnprntate for differences in hybridamtion from probe to 
probe. 

2*3. StwutsHSgBSfoii 

Comparison of data obtained from j n d i p fpd r.n l arrays and 
from different laboratories reqttires stajtdare%ation. Both the 
AfTymetrix chips and the custnm made cDNA chip* use differ* 
cut cnslhods for standardissttoa. The Affymeuis chips have 
. approximately 20 probes per gene and stainlarotetion is ei* 
thcr based on the cscmsma bvel of ccf acted grnca, like actin 
and OAPDH, or oa a setting of the global chip irttensity to 
spprojumatdy ISO units per gene on the chip, la this way, 
chip data from d^ereni cspejuiiuiis can be compared to each 
other. In our hands, the data obtained with the two standard- 
cation methods differ only by approximately 10% (unpub- 
lished obscrnitioas). 

'The custoavmade cDNA or oligo arrays also require ctan- 
dardemtioa, hot this is a complex problem, fo general, the 
standard used often reflects the purpose for which the array ' 
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was produced. For example, for aprcssion mentoring of 
breast cancer cells, a mixture of breast cancer edl Uses nay 
provide a good standard (26). Today, however, time is oo 
golden standard that can be used for all purposes and as a 
result, it is difficult to compare data from different laborato- 
ries and often it is accessary to use other technologies such as 
Northern hybridization, rcaHime PCR or ixnmunostainiog to 
validate the signals. A ninimum rcqvirancBt b that boors- 
tones that produce arrays themselves should be abtc to repro- 
duce data from ooc chip generation to the next based on the 
analysis of wdldefioed controls covering different genes and 
espressos levels. In addition, it is common to use spiking of 
samplei with bacterial genes that hybridise to probes, epotted 
for control purpose on the arrays. 

2.4. Semptu for expression monitoring 

The analysts of retaoVrly homogeneous ceO populations 
(doited cell lines, yeast, etc) has proven much simpler than 
the analysis of tissue biopsies as the latter often contain many 
cell types (cpiihcbaJ, endothelial, inflammatory, nerve, muscm, 
and connective dstuc ecus) that are present m variable 
amounts. Standardization may require microdaxection of. 
the tissue to isolate specific cell types [Z7.2ol although the 
oumber or ecus needed for the assay is wcD above a million. 
Sampling of specific cell types using bucr capture ~ 5 "^ t rnrr- 
tion (LOO PQ can be a tmwconsumlng task, pad ttreo that 
mftNA b prone to degradation the pr oc easin g time must be 
kept to a minimum If only a small amount of snaterial is 
available, then a reverse tramxription-PCIl step la i 
for amplification, but this adds an addhional 
due to the lack of linear ampfificnUon of all transcripts. In 
one of our laboratories we have used prcparatwas of tingle 
cetl types from tumor biopsies to standardise poofing of sam- 
ples for generating profiles of gene expression at different 
stages of tumor development (manuscript in preparation). 

2 J. Bioinfomuttlc anatyji* of expression data 

2.5./. Mererdttml duller olgortiPuni Hierarchical duster* 
tog algorithms can be divided into two types: egsjlomcrative • 
and divisive (Fig. I> (50). The aggjomcrebvc method b a hot* 
tonvup approach, where the algorithm starts with n separate 
crosiers (for eaampte 4000 sjeoes, where a a 4000) and scaecs* 
sively combines dusters until only one b left The divisive 
method, on the other band, b a top-down approach starting 
with one duster and succcssivdy rptittmg clusters to produce 
others. The algorithm used to form the dusters must also be 
defined; two widely used and simple algorithms are the single 
linkage and average linkage methods* respectively. Single fish* 
age, alto called nearest neighbor, defines the distance between 
two clusters as the minimum distance over aD pairs of dusters. 
Average linkage takes in consideration the average distance 
ova all [30]. 

A distance matrix must be calculated before clustering b 
performed, and it b the distances between the obtained gene 
expression profiles that arc used to form the actual clusters 
PO). Observations with small distances arc grouped together 
as described above The two most commonly used distance 
measurements are the Eudideaa distance end the Pearson 
correlation coefficient. 

Euclidean distance : ED '*» ^{X*—XpP 
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Fig I. Two main typo of chsUrcig algorithms, the btrrarducal 
and the nos-AusejchaaJ algorithms, 



r^carion correlation coefficient : 



*o b the mcasuicsncot for the ath variable oo sampling unit I 
y* b the variance of the Alb. variable (30,31). 

2.SJ. HothhUrmthkmi chattr algorithms. In oon-hier- 
archicat duster analysis It b assumed that the data can be 
divided into a certain number of clusters and thai they are 
well asperated The advantage of this approach b that targe 
dtta seta can beciustcrcd much faster than by using rtfcrarchi* 
ca^dunering because *}*^ Pu n^ of dusttr t fa assigned 

yds b fcenenns. However, n snsthod termed cslfergnalsed 
maps ($OM) has rooently been appbed to caprctsroa data 
g rnemte d from DMA chip arrays (32*33). The fcmeans meth- 
od (34J identifies ft points that function as cluster centers. 
Each data point b then assigned to one of these centers in a 
way that mini i n i iT s the sum of the disfanrrs between aD 
points and their centers. Thus, it b the distribution of points 
that dodoes the value of the means. One dmwback of this 
method b that -a specific siueober of dusters is assigned, aa 
the number of ctnatcra b usually uDhsowa in targe dan 
sets. SOM b dmftar to the fccneenj approach, but it baa a 
gecau tries! configuration and the number of ooda pmdiunrs 
this conflgumnon. Inttally, dam points are mapped onto the 
geometrical cxmfiguredon. When dustering the data with 
SOM the position of n node migrates to fit the data points 
during iuuuual v e iterations (32J3). 

2.XS. SuporMsed etossyietttlB*, Cbmsnon to the clustering 
mctbodi in which array data ate used b that they am unsu- 
pervised, Le. no pjnrfiflnid references are tmowo. An aftcma* 
tive option b to osmatruct n supervised class ihcntioo method 
that requires at least two references. Per cane 
for example, the references could be the gene 
film from normal and mvarivc tumor tissue. In tab pirtrmltr 
case, a vector ftpj entitling gene expression over #• genes can be 
used to describe each tissue pj JQ. to a recent aruete by 
Oorab and coworkers (30), the authors analyaed 6117 genes 
using 38 bene marrow samples. Based on these 38 samples 
they found that o vector based on 10 and 200 genes was. 
sufficient to distinguish between acute myeloid leukemia and' 
acute lymphoblastic leukemia. Thus* the authors were able to 
oanstruet a cancer dassifier based oo a low number of genes. 
Using a similar vector-bused classifier approach. Brown and 
coworkers p7) analysed 2467 yeast genes in 79 discreet ex- 
periments and were able to classify genes into functions! cat- 
egories based on the cs^fcssroo data from DNA chip arrays. 
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2.6. Applications ofaneps fir cjpwswn monitoring 

One of the maia areas for array application it in thi omul 
tantous monitoring of thousands of transcripts io different 
biological sdtiogs. The approach b bong used to identify 
new networks ond to' understand patterns of expressed genes. 
A number of articles have been published using the amy 
technology aiming at identifying disease-associated alterations 
in humans. For this purpose clinical samples, human cell tines, 
and m a few cases animal models of human disease have been 
used. 

2.61. Gen* tMprestian prcfihng of ttuvt biopsies, Cdl tint 
and animal madgts pf disease. 2.6 J. I. Thsvg btopsUx Gene 
aprtssion studies cm clinical sample* have beta pefformod m 
breast and colon cancer (26.31), as well as in atherosderosis 
P°.40). Genes of presumably known functions have been 
identified and linked to the diseases; most of these data arc 
now evanablc in the Internet (Table )). In a study of breast 
cancer that used dirtka) tpecimem and cell lines, with aa 
amy coataioiag approximately 5000 genes, Fcroe and col- 
leagues (26) identified a proliferation related gene cluster b 
the cell line that was upicgulated in the more agarcssive din* 
tea) breast tumor r p crfmra s (26). la a similar study of colon 
cancer based on the AffynsttrU mars. 41 ESTi totsesogeua 
to ribosoaal proteins west found to be unregulated b the 
tumor tissue (31). In this study, a muscle tndea was used for 
enrrcetiug tor the svomal cwmponenta as this showed a high 
bdea m the norrnai biopsies. Both the breast and the cobo 
cancer studies lacked a correlation between gene eapjutiou 
levels in the cell lines and in the dinscal specuncaa, 

Human athtmderesb lesions from arterb carets an pled 
from patients undergoing surgery have also been analyzed 
ustog the AJtyrnetria expression arrays (40). Oat mportant 
finding was a five-fold tmregulatioh of the carry growth re- 
sponse gene Egr-I. a DNA binding protein that bfricnoes the 
tnuscripn'on of genes erieoding growth factors, cytokines, ad- 
hesion motecuks, and protrins related to coagulation. This 
finding was cortafaorated by onjnonohestocheeaistry and ani- 
«d espcnnwBU, and identifted Egr-I aa a possible target for 
thempeuttc intervention. 

26.12 CHI ha&r. Several studies based on expression 
mcfflhofing have been performed m human cdl lines b areas 
as diverse as cancer (411. ophthalmology and the central 
nervous system ftt). Arrays have also been used to study the 
effect of cytokines (interferons) (44], cyioniegalovints ufection 
(4*1 oml oaGogcnc tranafecsion (46] on the overall patterns of 
geat cspressten. In one study, human foreskin fibroblasts 
were in/octed with human cytomegalovirus and the csxsessioa 
of •pproiiraatdy 6000 genes was monitored for up to-24 h 
(45)1 A total of 238 gracs was found to be unregulated more 
than four-fold these included HLA-B (upiegulated eU-fotdX 



a protein that protaeu against cytotoxic T-rympbocyto; ROI 
SSA (52 kDs protein mRNA; unregulated 12-foldX • com- 
monly targeted autoanUgta, aswtOas several compoaeota of 
the pathway that produce prosugbndin E2 from arachidonic 
add. Io another study, it was shown that ticatenent of the 
human fibrosarcoma cdl fine HT 1080 with IFNs o, f and 
7 resulted in the utmta^itation of novel genes implicated to 
apeptosis (RAP46, Bar I, scramblase), while genes like 
IGF-2 and ZnT-3 were slrongr/ dewnregulated (44). 

The effect on gene expression of the f usio o oocogene PAX3- 
FXHR transfected Into NIH 3TJ cdls has nho been studied 
ususnhrarreya (461 U was shown that the fusion gene, but 
not the wild type control was abb to activate a myogenic 
trefiacriptieo program that iachrded inductioo of a number 
of transcription factors such as MyoD. myvgenin, SUI, and 
stun. 

J.&/JI Animal moaW sjatcmr. A common approach to 
the study of human cfteascs is to use animal modd systems. 
Ibis has been dose for a range of diseases Intruding enoepha- 
hrayetopathy, ryrnphema, renal tubuli (23), and hug fibrosis 
(4*3- Io the tatter ease, AJffymctrU chips were probed with 
pooled samples obtained from group of set anneals to rcduos 
variations and cost. Treascripta that were dssiatanUy altered 
b rung fibroin (47) befoded extrsnOuhtf rnatrbi and bfian* 
matory response genes. Analysts of dusters t-^i these 
gfacs b a time course experiment with the SFOTTIKB FRO 
9.0 program revealed duTercol tempera) patterns of upiuslon 
that farther subdivided these genes. 

2.6JL Tortcalotr md &rvg usttnj. From a tosioologrcal 
point of view, there are great upcetanona for caprcssieo mon- 
itoring as the effect of dross, both expected and unforeseen 
aide effects, can be. caonitored to animals and cvtsniaOy to 
humans (48,49). One prottcsn that has arises from these 
to quite snaple modd systems, b the often- 
chaftgrs b transcript revets observed. These 
I are suite reproducible, snggestag • couch snore cons* 
plea rebtbnshlp among gene products than p r ev iousl y 
thought (49). 

The oombbadoo of gene esprcsaieo rnonitoring and testing 
of drugs on cdl Unas and in animal models boats arcat prem- 
ise. A recent rajtthatiOB showed that the variation b the liver 
csyseasion of genes encedmg aeseenotsc mrtahohsiag ess- 
tymes, grumthione sv^ubuors, DNA repair ensymea, heal 
shock proteins and housekeeping genes b larger among inoV 
vsdual animals than that mtfeducad by the array assay itsdf 
(49). These studies revealed the uprca^ilatien or cytochromes 
even at low doses of tVnapsttto&veno tmttnent and showed 
a good correlation between the array and Northern hybruli- 
ratton data. A reoectt screening usisg cDHA rnicroarrays of 60 
human cdl tinea used by the National Gaoccr Institute for 
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drug discovery, has formed the basis for csub&biog a data- 
base (bat cao be ticked to gesc expression data and molecular 
pharmacology |4 1 .50). 

2.6.1 AfottaJar claxsificait&n ofdittasti Qassficattoo of 
diseases cao generally be achieved by class discovery and dan 
prediction |J6). Class discovery refer* to the identification, 
based on gene eapressioo, of previously unrecognized sub- 
types of the disease. If clinical follow-up material a arailable 
these classes can be related to signs aad symptoms* disease 
course, treatment outcome, and mortality. Qtss prediction 
refers to the ability to assign a particular patient to aa already 
defined class, based on molecular aaminatiob of the diseased 
tissue or other sptcurtens. 

Two recent examples have demonsuated the power of ex* 
presrioo arrays to classify btmatologfcal malignancies. In one 
of these studies, leukemias vera classified aa A ML tad ALL* 
respectively, based on 50 genes selected from an array carry- 
ing 6tlT genes, lo this study, 36 oat of 38 patients were 
correctly dosafied, and two ware uncertain. The 30 genes 
used m the array included some that were known to dbTcr 
between AML and ALL, as wed as new markers (36). 

Another study classified B-eel) l ymph o mas into two motec* 
ularly distinct cfesscs that reflected different stages or eVccfl 
- differentiation, the gtrcsiaal ccntcr-tikc and activated BkkD 
tike lymphoma groups, respectively. For this study the lym- 
ph ochJp tntcroarray from Research Omrtfra, which hoMi 
17156 genes, was used. Follow-up for 12 yean showed a rig- 
tdficaat difference in survival tmong these groups |51|. 

li b envisaged that global surveys of geoa expression wtH 
identify marker -genes that, may be oxed to group patients loto 
motoculxrty retcvaot categories; these rnaxkers are expected to 
greatly improve the precmon and power of clinical triab; 

77. Conthsfm 

There is no doubt that the massive parallel gesc expression 
• iafomatfea generated by rnkroarrmys wiD have a major mv 
pact in the d isc ov er y and uodmtxntting of patterns of ex- 
preased pees. In addition, the technology is apectcd to gen- 
era tie Dovd and effective targets for drug discovery and 
provide, in combiaatido with protcomies, valuable toob lor 
the entire process of drug dn* lepra en 1 nod evaluation. 

One of the main challenges we foresee in the future wQI be 
to solve the problems posed by the analysis, interpretation 
and access to the targe amount of information that will be 
generated. Large studies of ESTs have oot been published yet 
as dam analysis requires the devdopment of new btomfor* 
ma be toob that can deal with the' huge amount of informa- 
tion thai is being created; These studies arc expected to iden- 
tify new genes of irnportartce to specific biotogrcal processes 
and reveal new regulatory pathways through the analysis of 
the expression levds of individual BST*s in huge ownbera of 
samples. 

lo u\e future, coupling of expression monitoring to trans- 
genic animal modtb may prove to be quite rewarding as the 
global effect of a gene knock-out or knock-in can he moni- 
tored with both micros nays end protcomies tools |I0| 

3. Preteomlca 

A complcToenury technology to DNA mieroarrays for 
monitoring gent expression is provided by protcomies. a 
term generally used to eacapsulstt aO of the technology cur* 



really available to analyze global patterns of gene expression 
ai the protein revet Proternj art frequently the functional 
molecules and, therefore, the most likely to reflect differences 
in gent expression. Genes may be present, they may be caw 
tetcd, but they arc not necessarily transcribed. Some messrsv 
gars are transcribed but oot tranihtcd. and the number of 
mRtf A copies docs not necessarily reflect the number of func- 
tional protein reofocuftj (52). In addition, protcornics edV 
dresso problems that cannot be opprcocfaed by DNA analy- 
sis, muetty, relative abundance of the protein product, post* 
transtiiioaal rectification, lubcdlular localisation, turnover, 
mtstactton with other proteins as wdl u functional aspect*. 

Tot proteome has been defined by WHkins aad colleagues 
as the uuDpHte set of psotates encoded by the gessoes* (531 
and recto Uy, the teres has been broadened to include the set 
of proteins apressed both bi space and taxu. There are two 
main approaches to ps oteo mfca : one is the expression foodd 
in which sH proteins ore analysed, and the other b the cell 
map model io which only a selected an of proteins, like com- 
plexes and orgsusata, arc studied (54). 

The protaomlc technology b complex, and cornprises o 
plethora of stats of the art tiTfmrqiTrs to reserve (high reso- 
hitioa rwo-4uxus3sioual gets), quantbate (phc^peurrmager, 
sptcbl scanners), identify ancVcbaractxrixe proteins (mmrose- 
qucmingv mam spectrometry), aa wall aa to store (twe^dfescn- 
deaal r ^l y xc r y tamide gri deebvpaceesb (2D PAGE) dam* 
basts; txtp'Jtotebuddkft&Wstib; http^capajyJbciigc^h/ 
sprot/cprot*top^tsnO reimmunkotc and mtcronk protein and 
DNA sequence and snapping tnfotnudoD Ibiouaforaaatks) 

po^s-sq 

The ngturarn of the 2D PAGE technique for ksrge-sceie 
proteortBC projects depend s very much on the number of pro* 
Jcins Chat can be resolved in a complex protein mixture, for 
example a smmao cefl. rVeteomc profile date from a few tab- 
oratories, bduding one of our own, have fcactated thai only 
a fr action of the burnau gents are switched on to a given cell 
type tnd extensive ensJyib of whole orO extracts, organelles as 
well as partially purified subcellular Treasons, suggest that 
individual edts may oot express more thao 6000 primary 
transtaiion pr ed nt bi \S9\. To this number cms baa to add 
the post*Cfi n s l a tf one ! processing and chem i ca l caodiacntbsna 
fjihosphorytation, gfyeceymttoo, dcBBCthytaUon, ecctylatioo, 
myristoybtrao, palmltoybtioa, sulfation, umquhmtims, etcX 
the latter being rather common and cetenxfve in many pro* 
edas [60,61). Thus, as we stand today the 2D FAOB technol- 
ogy is ooi abb to resolve nod depict b a single gel all of the 
proteins thought to he present In a csarasraliaa cdL 

Currently, there b a greet deal of interest fa pr o tnum io as 
cppbcstlons of tluj technology era ctpoctcd to srveal gent 
regulatjcn events involved fas dbease uiugiiisfon as wefl as 
potential taigtts for drug discovery aod dJagnosttcs. More- 
over, the ochnotogy b bound to have a great Impact m agrv 
culture, lexicology and the todustry in general. 

3.1. Ptottia ttparctiom: lAr 2D PAGE tethtehgy 

For the past 24 yean, high resolution 2D PAGE has been 
the teehaiaoe of choice for analysing the protein composition 
of celts, tissues aod fluids, as wtO as for studying changes to 
global patterns of gent expression dicHed by a wide array of 
effectors (12,62-64). The techoiqun, which was originally de- 
scribed by OTarrcO (63.66) and Kfott (67), separates proteins 
both in terms of their isoelectric pemt (pi) and omroeular 
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weight and provides ike highest resolution for protein snajy. 
ri*. Usually, one chooses a condition of interest, for rsimplr 
the addition of scrum to noa-differcotiatcd human .keiati&O- 
cytcs, or compare oonaaJ and transformed ocfls (Fig, 2A,B), 
and let the oefb reveal the global protein response as all de- 
tected protons cao be analysed both qualitatively (poit-trens- 
laiional modifications) and quaatiutnrcty (relative abundance, 
co-ordinated expression. Fig. 2C) to relation to each other 
Q 55.68J and references Otcrcio; see also http://Mobase.dk/ 
cp-bin/otUs). 

For many yean the 2D PAGE technology relied on the use 
of canter ampholytes (amphoteric compounds) to establish 
the pH gradient, but this technique has proven to be difficult 
because of the lack of reproducibility created by uncontrcl- 
table variations to the batches of ampholytes used to generate 
the pH gradients. Lately, however, with the rotroduotioo of 
omnobilizcd pH gradients (IPOs) (69.70]. which are integral 
pan of the polyacrylamide matrix, it has been possible to 
obtain focusing patterns that' can be cnsOy reproduced by 
the norxxpert. IPGs avoid some of the problems associated 



with carrier ampholytes gosh as caihncue drift i 
sis, allow a higher loading capacity for a 
and provide tacrcasod charge rcsorotton when narrow pff era* 
dieats (QjOJ pH uniifcm) are used (I7D.7I] and references 
therein} In our hands, however, carrier amphorytts Q.S-10, 
Fig 1A) and broad range IPOs (Fig. IB) resolve similar num- 
ber of ("SJnsctbioiraic tabcDcd iw i yusptidu (about 2300) as 
tUusuated with the cenaratroa of whole protein csrtracts from 
bbc&od human UratSBoeytes (72). It hag been p/opond thai 
narrow range* overlapping IPO gradients viewed aide-br-slde 
may provide a aolutioo to the probtem of resolving and de- 
picting the proteomc of a given ceil type. Recently, however, 
Oorthab and co-workers found this solution tmrcatistk; as ti 
wED require the running of a hup oumber of gels (711 

Very basic p olypepti d es hare proven difficult to resolve, 
attboogh both carrier ampholytes (non^uiTibrium pH gra- 
dient cJecuophoiesis, NEPHOE) (66,73] and IPOs 9-12 and 
4-12) Q70] and rcTcrea.oa thcrcio) have been shown to sepa- 
rate basic proteins. 

One of the most important steps io the 2D PAGE techno*- 
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ogy concerns sample preparation as very often some proteins 
cannot be p i opcrry dissolved by the ryab solution originally 
developed by OTerreU (65). Thus, there U pressing oeed to 
develop protocoU for optiroims sample sorubuixatioD, To- 
wards this turn, Rabilloud and oo'workcss have node us* of 
the high loading capacity of IPOs to resolve tmmsbrane pro- 
tein* for structural analysis* and in dotnj so have unproved 
their solubility by uxfng a combmiTion of detergents sad eba- 
otropes |74,75). It has bam shown that the addition of thio- 
urea, CHAPS and snlfobctaia suffafltaats to the lysis solution 
containing urea results in a much improved solubiuaatioo as 
wdJ as transfer to the second d uatasion SOS gel As far as 
nadcar protciaj are concerned, OArg aad colleagues have 
improved cxmriderably the anarttBCn of very bade proteins 
by first predpiutiag the samples with acetone prior to sofubi- 
(nation fo the lysis sotatioa (70). The problem associated 
with the extraction of tissue samples, on the other hand, an 
much more complex and have not been addrexxed yet ta a 
systematic fashion. 

31.1. Detetticn. An important limitation of the 2D 
PAOB technology b the lack of very sensitive procedDrcs to 
detect those proteins that are present in very tow abundance. 
In addition, detection procedures are needed that eaa be ap- 
plied to a targe number of reserved proteins whose abundance 
may span through seven or eigbi orders of nugnirude. 
Clearly, the sensitivity of sUvcr nitrate and Coomaswfi Blue 
staining is inadequate, aad only snetabobc labelling with spe- 
cific isotopes may reveal enough proteins to warrant protco 
mic project!. Furthermore, the use of pfcosphor-iinagiag tach* 
oology may enhance the sensitivity and uneasily of detection. 
Limitations of the raeVolabdtiag approach include (5) tack of 
labtlltag of some proteias due to tow turnover, (II) problems 
assecbtcdwitb safety regulations and disposal, and (ui) diffi- 
culties in obtaining fresh human biopsy material for labelling 
experiments. Ideally, one would like to have a highly sensitive 
fluorescence- based protein detection technique able to support 
all types of studies irrespective of the sample, or the cad point 
of the analysis. Preferably, the dye should not alter the mo- 
lecular weight and p/ of the proteins if it b to be added prior 
to deorophoresis, and should support quantitative studies 



involving proteins having extreme differences in their copy 
cumbers. Uafoftaantdty, bo such ideal dye is available on 
the ih ai hsi yet, a ltho ugh Oxford OrycoSornct has developed 
fciut e uj e ui IPO-MOB (htrp:/hs^^gxxomfpToteonc/ 
torabtau% a trchrtotngy not available to the srirnttftr com- - 
niuaity. Ruorescenoc opmpouncts such as SYPRO Orange, 
SYPRO Red aad SYPRO Ruby have bees used to anarync 
whole protein rysates from bacterial aad "**--"»""» ceOs, but 
their sensitivity (1-2 ng} b slightly lower than that of stiver 
nitrate (76,77], Sent advantages over siher staining tnetude 
short staining tone aad the fact that the gets do not need to be 
fixed prior to staining, In addition. Bab or no *-*-g b 
recjutrcd. 

For tow n hon da h na proteins of known cdcatHy, cfc tnui o a 
dots aot pose a problem as Gels and on>worfccrs have shown 
that 2D 'PAOB unrnunoMottiag in eombinstioa with en- 
hanced chcnuTumioesccnce (ECL) can detect as utile as 100- 
300 mntfrntrs per cell in unfractionated ecOuhv extracts (60). 

$JX QtaatttttUm, Even though there arc several toob 
available for the quantitation of protein spots, there b at 
present no available procedure for quantiutixig aB of the pro- 
teias resolved in a complex mix turn. Part of the problem Gas 
la the huge dynuoSt range of protein expression, lack of res- 
olution, post4rsnshtional modifications, staining behavior of 
the proteins, as weB as la the fact that many abundant pro- 
teins streak over ass abundant eompoacnU mteffcring with 
therneasufttnents. At present, fluorescent tcclKotoey seems to 
be way ahead; as with the fluorescence stain Sypro Ruby 
there b a Baser reapemsa with respeo to the sample amount 
over a wide range of abundance fJT7| aad references therein). 
Quantitative uHiOTCsceooe measurements can be performed 
wftb CCfVcamera based systems as well as with laser scanner 
systems Q77) and references there in> lo some cases, racoons- 
beJfini in combination with scintillation counting oflcrs a rea- 
sonable alternative for quantitaung a smaU number of pro- 
teins [68). 

3JJ. SdintificnttotL Methods of protein identification 
hive included unmunoblotting p8,79]> Ed man peptide se- 
quencing Q80.8I) and refcrcrioes therein), amino acid compo- 
sition (B2£3)» end more recently the use of matrb-ossisted 
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tuer daorptienfiofiiatioo (MALDJ) man spectrometry 184] 
and electrmpray ionization (ESI) |8S). The toner tcdmtquca, 
which rely onto* comparison of peptide mass fingerprinte, are 
fast aad require only picomel amounts of proteini. limited 
peptide sequencing can be performed rang tandem mass spec* 
trometry or posi-souiee decay, but h it la stfl) sot possible to 
make N» and OtenninaJ sequence identification. For a review 
en Ed roan peptide sequencing sec reference [81}. Andersen 
and Mann review protein identtfleattan by mass apuajuumry 
elsewhere in this issue. 

3.2. Making wmprthavitt 2D PAGE databmi 

Advances in bardVnre and software development (Elsie, 
OeAab, Metonte. Quest, Tyebo aad fCepter) for scanning 
and image analysis of 2D gels, aa wefl aa the development 
of sensitive toob for rapid protein identification, have cata* 
tyicd the establishment of craprtbansive proteomic 2D 
PAOE databases [64,85). These databases aha at toterfacing 
protein information with fonhonmtog DHA mapping aad se- 
quence data from genome projects* and offer a global ap- 
proach to the study of gene expression both in health aad 
disease (bttp^iobasexJkJcg>-bir^celij; htlp -J/upasy.bcugexh/ 
sprot/sproWop.btmD 0 10.55) aad references therein). Besides 
annotating genomes, these databases are expected to address 
problems that cannot be approached by DMA analysis, 
namely, relative abundance or the protein product, nost-tmns- 
bttonal modifications, subcellular tocaliation, tumour, in* 
teracuon with other proteins as well aa functional aspects. 
The first prototype databases were built by Gefis aad Bravo 
(87), who pioneered the use of protein identification tech" 
oiqucs to establish comptthcosivc 2D PAOE databases (88) 



Fig. 4 shows a. region of the syothetie master 2D PAOB 
image (tsoctectrfc foccsia& IEF) of | ,f S>swthiosine labelled 
protcias from fresh non-catered human keratmocytes as de- 
picted to the World Wide Web (htrp^obastxd^eg^hln/ 
eetis). FTotems ftajged with a red cross correspond I 
polypeptides that have beta idriiiilhtl by ease or a j 
ttoa of techniques that include (i) 2D gel 
using spwifle sntibadsai aad the ECL det ect i on prooedurc; 
<u) mtcrosequrttcing of Cbonmssto bruhan) Bhic stained pro- 
teins PJ9& (u) mass spectrometry of tryptsc peptides [90) aad 
flv) rrmrfgrarfoo with known human proteins (odividuaJ pro- 
tetos aad organelle compeatnti) and (v> overlay trrhntqirrr 
159) aad <W) transient opresstoa in marmnalian oslb f91). 

To date, 1237 polypeptides have been identified to this data- 
base (IEF and NEPHGE) of the 91 59 that have ben resolved 
and catalogued. In edmhoo to p'Shnelhionins UbeOed pro- 
tons, the daUbasn orataias a few polypeptides that tack mo- 
tetonine, but art revealed by eDver staining, Cbomaote br3- 
' bant BWe or by tabtOmg with a mixture of Id ("Qamroo 
odds. It should be stressed Chat some protcinj migrate both 
in IEF and NEPHOB gets (for example, exnolase, triose- 
phospbate isomcmse and elongation factor 2) and they may 
serve as budmarhs to align (he gcb and as references to oor- 
malixe quantitations in both pH directions. 

Informs tioa gathered on any given polypeptide, known or 
unknown, can be retrieved by choking on the wwipondmg 
spot, to this case 144-2 sigms, also known as stratifio (Ftg. 4) 
192). A file containing aD of the informadon entered for this 
particular protein, mostly obtained from experiments per- 
formed In our toboratoiy, to shown in Fig. 5 (only a fraction 
of the file is presented). Files for known proteins contain links 
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to a sublet of Medline (hup ://wvw.scbUua) jto.gov/ 
ftibMecV), Swiss-Froi (hup ://c^suy Juaigt^h/jprol/ipTOt-top. 
html) and PDB Cbttp^/v^jcmb^ftadclbtTf^dc^oOi Other 
boss include OMIM (htfp;//www rrrhi nhn nth grw/ Ointm/X 
OeaeCards (nttp://bipmforniatk».wcieniann ar Ufca/ds), UnJ- 
Gene <http^/www^biAb9.Dib40vAlaiOcat/utaAtad) nod 
other Web sites such «s CySPID (cytoskcfctsJ protein dale- 
base; bttp/fjparfla mrd ymle^^^painer/cytpAdh/lndmu 
btml). metabolic pathways (compOed by X£OQ: tX\pJJ 
www.fsigmcj^jcAcsgO* the cytokine explorer (hnp:# 
kbfli coii < fnitsouft^du:d43feytokiiMV^ptofgr.htiplX histology 
imtftJ (http://bi05un.bi obasc. dk/~ pdi/^iMJu/uiiuogfanto. 
buri), etc. Id the future, bj new databases aad related Web 
site become available, h wtl) be passible to navigate througb- 
out various databases containing complemcotary information 
(ix. nucleic acid aad protein sequence, g iaouK nappbp^ dis- 
eases, protein structure; poA-transtatsooal modi/lea lions, anuV 
bodies, rignalbng pathways, histology, etc). Ctearr/, data- 
bases allow easy access to a large body of data: once a 
proicifi is identified in given databases, aO of the information 
sonmmtsfrd can be easily retrieved and made avsibbte to the 
researcher. 

Today, 188 infonnatioa categories are available io the 
World Wide Web version of the keratinocyut database. These 
include ceBular localisation, pathways, proteins affected hi ' 
psoriatic keratinocytcs, proteins cspicsscd in normal urotbe- 
bum, kcreiinccyics, fibroblasts. He La otlb aad bladder caret- 
noma oeD line RT4, levels in fetal human tissues, partial ami- 
no acid sequences, abundance; cytoskclclal protons, calcium- 
binding proteins, snnrimi, chaperonins, heat shock protein », 
etc. 

Functions to query the databases include search by n&rric. 



protein number or keyword* (Ftft. 4), coolecutar weight and 
pi, as wed as by organelle or cellular compontnL By cbcainj 
on any ofihe organelles; celhibv ttruttores or vumpumu ta it 
b possible to get a protcb nil as weQ a* tbear relative potV 
uoas on the master gnaga Io addition, ons oaa retrieve a ust 
of all Ijiowb proteins recorded to the database, Moreover, we 
otooob and videos of preparative steps that can be 
I to reproduce the data dbpfryed as wefl as o gallery of 2D 



Aarhut 2D PAOB eauuautt available at http:nVobasc.dKS 
cgWnlccas lacfude tia aridon al cell eardnoanas (TCCa), squa- 
mous oeO CErctoonsas (BOCaX oriac, fibroblasts, and moase 
kidney ecus. Other databases available in the Internet can be 
found b bttpV/eapasy hmgr rffv^piot/spTOWtop.haad; htftp:/) 
www.narchrhl nlbSfwijha.nVBj^/^TotcD; and bttpj/vsKf 
paaexhaa^oJu-'baxBajde/^ pjsssaMhtbAtnsL Also; 
pfoteoavc tools for pro lei 
tioa, priniary structure analysis, i 
boa, tcraary structure anc? DNA ttmnsbuao into protein are 
avaibsbb at the EXPASY pioteo i nk server (hRp:/ravrwjcapa- 
rjjcMwwwftoott). 

J J. Applications 

To date, there have been thousands of reports ilhsstrabng 
the usefulness of the 2D PAOB and protaomsc technologies m 
many areas of biology. fJiimiiii of space limitations; however, 
only o lew of the application areas are highfighted betow: 

J.I.I. Qouxr. A great deal of research has been devoted 
to the cfuddation of the pathways that control cell protifer* 
alien in normal cells, and hence, the detcrtainauon of the 
nuans by which ahentisas of these pathways lead to abfior- 
oud growth charsxtertstlcs and/or a«oxdssac iraasforraauon 
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and caoerr. Most studies tun focused on otrcojroa, tumor 
suppressors, cell cycle regulated proteins and sign) transduc- 
tion molecules in various ceil types of different species (93-95), 
but only in a few cases inert turn bees systematic attempts to 
analyze the protein pbeootypc of pairs of conns), and bans* 
formed eel) rypes using a protcomJc approach (60.60,97-99). 
So far, only very few studio have made use- of biopsy material 
due to problems related to tot cdl baterogcaeity. 

Already in 1982. Celts and co-workers started a proteomic 
approach to the study of cell transformation uso| doncd cell 
. lints (68,97,9I|. Their resuhs showed that transform* ihra re- 
sulted in the abnormal eapreuion of oonnai genes, rather 
than in the expression of new ernes (98). Id addition, their 
studies raised a word of caution concerning the widespread 
use of protein information derived from studies of different 
ceil types from various species. Today, we are.wefJ aware that 
cultured celts* undergo important changes when placed Is cul- 
ture due to different environmental factors and growth con- 
ditions ||Q0) and accordingly, current efforts using the proton* 
rate approach are being directed to the study of raptured 
celb and/or tnsue biopsies. Among the cancer projects cur* 
rentiy underway, those centered on tsuaarmta and hematosog- 
ical tnalitveoexsv breast caaoer, cotorectat caaeer aad bladder 
cancer are briefly mentioned below, 

J. J././. Udtcmlo art hematologies maHgmcki Studies 
by Hanash and colleagues oa chMhocd leukesnia aad other 
haniiotogtcal rffflttgyianrfrs have yielded to far several 
marten thai include Oplg, abo known as an 
oncoprotein that has been impSemted tn signal bafisoactioa 
(101,1011. <o childhood leukemia, phosphorylation of Oplg 
was shown, to correlate with a high content of eeOi in the S- 
paasc suggesting a role in proUfcretioa. The group abo 
identified omB-HI (nucleoside diphosphate kinase A), a 19 
kDa protein that is upregulated m aormal lymphocytes 
treated with inhogens as wdl as in leutama eelb from 
patients whh acuta leukemia (103). 

J.J./.7. Brtatt cancer. Systematic atudjcs of dinicai breast 
tumors of different histopathologjcal types by Frances and 
co-workers (104- 106) have revealed several proteins, including 
FCNA, hspoO, hsp90 and calrctsculin that are UgWy deregu- 
lated in invasive easciaomas and thai may serve as prognostic 
Barkers. These studio have made uso of fresh ofiatcal rumor - 
tissues of different subtypes aad have paid special attention to 
sample preparation. 

J.J./.J. CotortcuJ ameer. Studies of Jungblut and col- 
leagues (107-109) oo sets of raaefosoopjeouy aormal colon 
mucosa and colorectal carcinomas have revealed several pro- 
teins that are deregulated in the tumors. Dcnmregubtad pro- 
teins included the liver fatty add hading protcss, the smooth 
muscle protein 23-cs, aad c^oosygeease 1 Ucrcpriatcd pro- 
teins taduded the heat lhsek protein 70 as well as several 
members of the S~1Q9 family of cakrao- binding proteins (S- 
100A9, S-100 Ag, SI00AI1 and $.|00A6>. Seme of these tmd- 
iags have beco confirmed by uamunohistoshemioal studia 

3 3.1. 4. Bladder cancer. Celb and colleagues (110-112) 
hive explored the possibility of using proteose expression 
profiles of bladder rumors as fingerprints to subclsssify hiito- 
paihoiogieBl types, and as a starting point for searching for 
protcia markers that may form the basis tor diagnos is, prog- 
nosis and treatment. To achieve these goals they have ana- 
lysed the proteome expression profiles of hundreds of fresh 



tumors as waD as random biopsies aad cystectomies |l 1 0-1 12], 
aad have established TCC and SCC proteomic databases thai 
may provide a solid Wrastructwe to support future studies 
Ql 13); bttp^/biobasc^Wc^bin/ceiis). In the long run. a prac- 
tical goal of these studies b to identify a eompbte set of 
protein btoatartcrs that may be useful to classify histopatho- 
logjcal grades, and that will provide with spaoifie probes for 
the objective diagnoses, p ro g no sis aad treatrneot of these b> 
siont So far, these studies have revealed markers for TCC 
progression (1 101 a marker m the urine of patients beating 
SCCS 1 114.1 IS), and have led to the d e vd opmc at of n novel 
strategy for testifying prtmajignam squamous lesions (112). 
The approach snakes use first of proteomic technologies to 
reveal and identify proteins' that arc differentially aproned 
m pure SOCa aad normal urotbetium. Thereafter, specific 
antibocQcs against the differentially expressed proteins arc 
used to tamunostein aerial oryostat sactfoos of biopsies (Im- 
aniaowalkng) obtained from SOC patient* thai have under- 
anas removal of the- bladder due b> Invasive disease (cystec- 
tomy). Since bladder cancer is a field disease (lid) - that is 
targe pan of lac bladder lining is at risk of drvetoping disease 
- h b cspected thai the nrathefium of these patient* may 
cshibit a spectrum of absormafitica ranging from metaplasia 
to mvasive disease 

Xli. titan dtscarcs. Heart failure b among the leading 
causes of eaortsJirym the Wo aad therefore, 

effort* are being devoted to the elucsaatloo of the -»»*--»ir 
events seeding to cardiac disjunction flll7) and rcfeitoces 
tacreis). So far, research on dilated cajdmesyopntby (DCM) 
(117*119), has revealed that spproximatety 100 proteias arc 
deregulated; inostty downitfipuntcdl in DCM as compared to 
their norma) coiaterpasts. These indude cyiosicrctal and my 
ofibrihar proteins, pc^ypepttdm assodatod wish tmtodroadria 
aad involved in energy pr od uction , as well as proteins asso- 
datod whh the stress rcaponse, Teas studies have beco expe- 
dited by ue nfaWi'iharat of proteomic 2D FAOE daubaacs 
of the humaa heart (veotricie aad atrium) ( 120] 

In addition to the global aaaJychi of protein expression 
patteros in humaa heart cuacesca, cardiac antigen opreasion 
following cardiac tra ns ph ustatto a baa also bona studied using 
twto rlq uc a from profaomkt For example, usmg 2D PAOB 
(cardiac proteins) in cornhfemmn with Western bnmujiohk*- 
tlag (patient sera), h has beta possible to identify antigens 
that react with autoantibodies present both in DCM 
1121,122] and tnyoesrdhb (121). In this way, antigens associ- 
ated with the antibody response that may be involved in acute 
or chronic organ rejection have been characterised. 

Proteomic studies using aainsal models of heart disease 
have also been carried out in aa-effort to unravel the mota> 
ular events leading to cardiac disease. Recently, two different 
studies involving tasgje suumab, psnofodtaad heart failure in 
the dog (124) and bovine DCM (125), yuOded tinnier reeutls as 
those observed in humaa DCM. Interestingly, the most strik- 
ing result in the bovine DCM study was the finding of a 
seven-fold decrease in the capitation level of ubiqiritin C- ter- 
minal hydrolase, as inappropriate ubiquination of proteins 
ho bean suggested as an ctiologk factor in heart failure (126). 

I.J.J, Toxteototy. Changes m the environment as well as 
the growing interest of the pharmaceutical rodustry have 
stimulated the devetopment of novel testing approaches based 
oa the recent technical advances both in genomics and pro- 
teomics. Pioneering studies by Anderson and Anderson (127) 
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have highlighted Che uwfutrjca and potential of the proieomie 
approach to identify qu&mjtativc changes in rat flvcr apics- 
aioo profilca unnamed with toxicity of drug* aad other scno- 
biou'es. The data, which ait ban* eystenutieally stored In the 
rodent molecular effects database, an expected to yield Im- 
portant iaformatioo aa to the coolccul&r ra^ehanisma usder- 
lying toxic responses. Likewise, the potential 'of pretocrmra 
have been recently exemplified in studies of glomerular neph- 
rotoxicity ia rata {1 28], and of stimulated occupational jet Aid 
exposure ta mice lung |J29). hi particular, the studies ofSteb- 
er*a group have shown a remarkable csmfatloa between de- 
creased levels of ralblndio D-28, urinary calcium wasting la 
the urine, and iotra tubular corticomcduDary calcifications in 
the kidocy of rata and human treated with cydofporin A 

In the future, proicomici b a high- throughput mode U 
cftpccied to have a major impact in the pn>diakal aafety 
testing of drugs. These Undies will be facilitated by the crtab- 
lishmcoi of 2D PAGE databases of frequent target tissues 
(kidney, liver) as well as of cell lines and (raids. 

J J. 4. hf euro logical disorder*. The Cfeutzfctd-Jaeob dis- 
ease (CJD) has been the subject of intensive analysis ustag 
proteomics. These studies have led to the Identification of 



two oscmben of the 14-3-9 fasmry of proteins fas Che cerebral 
spinal fiaid (CSF) of OD patscnta f'l31|. The presence of these 
proteins in the GBP baa bees used to differentiate CJD from 
other dementia both with high sensitivity and specificity 
(132,133). These proteins, however, are present in the CSF 
of patients suffering iron other neurological disorders not 
revolving dementia, limiting rts clinical value (134,135). 

3.4. Coxtbaien* 

. Today, there is no tectmotegy fa sight that mutches the 
resolvtog power of 2D PAGE, a oxhniquc that will continue 
to enjoy a central position in proteamie projects for some time 
in the near future. There is considerable room for improve- 
menu however, in particular as for as sample preparation and 
Kotubilicy, choice of pH gradient and detection methods are 
concerned. Alio, wc need to improve the separation of very 
basic as well as very tow and high molecular weight polypep- 
tides, bi general, one a poets researchers to first use wide IPG 
gradient* to obtain an overview of the protcome profiles, and 
then proceed with a mora demited analysts using narrow pH 
gradients, wfeieh provide higher resolution and sample load- 
ing, thus increasing the possibility of visualizing the lesser 
abundant proteins. The latter can be facilitated by the avail- 
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ability of specific aatibodies , as well as by the us? of catrac- 
lioo procedures and subcellular fractionation methods cur- 
rcnUy at hand 1136-138). " 

There arc Kill many additional chaltrogcs. however, that 
must be addressed bcTore a complete Human Prctcomc Prej- 
cci can be implemented (I39J. These intrude: automation to 
allow high-throughput sample analysis |I40^ improved quan- 
titation capabilities, better instrumentation and software for 
peptide sequential using man spectrometry, more sophisti- 
cated onage analysis systems to support gel omnpariseas 
and daubasing as wtO as improved btoioforraattc capabilities 
overall |I2). In addition, we need to deal with the problem of 
tissue ccD heterogeneity as more and mom proieomie projects 
will make use of biopsy rnatcrial in the future. 

4. Transcript and protein bvcts: DNA nueroarreys and 
pmteomlcs applied to the same samples 

As mentioned in Section I, both DNA cnlcsoamys and 
proteomim ait complementary tecnnoiogfes. To date, how- 
ever, there have been only a united number of studies in 
-which both technologies have been compared by. applying 
them to the same sample (141,142). Notably, the pioneer stud* 
cea of Anderson tod Scflbamer |52) showed that there is not a 
good correlation between mRNA and protein levels in human 
liver, implying that gcoe-bascd expression oats may be of 
limited vi roe in the process of drag cuscovery. The study, 
which compared the levels of 19 gene products, yielded a 
correlation coefficient of 0.48 between mRNA end protein 
abundance, a value that b half way between perfect and no 
coTTcbitioa. 

Recently, 0rntoA ct sd. (manuscript to preparation) carried 
out a oseroarray and pr oteo mic study of Madder cancer In 
which they compared the transcript and protein expression 
revels of pain of ooa-invasiv» and invasive low grade fresh 
TCCs. Bven though they could only compare the levels of 
about 40 wcJKicsntved and focused abundant proteins, it 
was clear that fai most cases there was a good correlation 
between transcript and protein levels. Only in a few cams 
.they found doorpaocto, and in some of thorn instances* 
they could not eliminate the possibility thai this was due to 
messeDger stability, post-transcriptiooal spitting, pcm-eransla* 
Uonsl modincatrons, protein focusing problems, degradation, 
as well as the choice of methods used to assess protein ca* 
predion levels (staining versus ndioubelling). For example, 
in odc tumor pair they found that the levels of keratin 8 
transcripts were much higher in the invasive tumor (compare 
Fig. 6C and O), wmle the protein leveb were much lower 
(compare Fig. 6A and B). Immunoblottlng analysis using her* 
atin 8 specific antibodies revealed that the discrepancy was 
due to degradation, as several related products of lower ap- 
parent molecular weights and more acidic pis, could be visu- 
alized (Fig. 6E). The identity of one of the crossreactiag 
peptides (radicated with arrows in Fig. 6£) to he rata 8 
was further confirmed by MALD^ome*«rdjght (TOF) (Fig. 
6F). From these studies it was dear that when comparing 
mRNA and protein levels there are other factors that 
ooed to be taken into consideration when intcrprctiog the 
data. 

When comparing transcripts and protein expression profiles 
of matched sample pairs one often gets the impression that 
there ore more changes in the abundance of the mRNA (ran-- 



scripts as compared to the proteins. Considering that the cur* 
rest 2D PAGE technology depicts mainly the more abundant 
proteins, it would seen possible thai most of the changes 
enacting protein levels may involve low abundance porjrpcp- 
Udes. 

S. Cent expression profiisg techniques: perspectives 

Novel and powerful techniques arc now available to analyze 
the global gene expression patterns of cultured ceils and tis- 
sues obtained from QormsJ and debased subjects. Each of 
these technologies ha* its own advantages and limitations, 
out in combination they should provide us with a detailed 
gene aprcsxfon etoorype at both the transection* and 
translation level. A major challenge us the near future will 
he to define a base use for lbs normal gene expression pho- 
ne type of a given cat) *n*im oibody toM. lib i» *o\ * 
trivial task, however, as tt will require the analysis of hun- 
dreds or even Uiousamti of samples. 

Besides impi ovcmtBtj on the rodi vidua! tecmsisjues them* 
serves (sec above), there am etui major Imitations that must 
be sddrcesed before these technologies can provide the 
expected ourxome in amlecuUr median*. These mcfude: 
(0 tcchnienJ emlicmj associated with the analysis of exprcs- 
sipo profiles derived from tissues that are coosposed of duler- 
cot edl rypes, (5) tick of |u oe edui e s for Identifying targets 
chat lie in the pathway of disease, and (B) eand for bwiufor- ' 
mattes toob for rapidly assessing the function of the putative 
targets. The latter, is of paramount teportance to the phar- 
maceutical wdustry as the ttatfficaticm of disease deregutatod 
targets alone is not lufiUest to start a oostly drug screctimg 

As far as tissue hetCTos^ncity if concerned, the recent ad- 
vent of LCM holds great promise as with this technique it fa) 
possible to isolate specific populations of ccfis from a tissue 
section using deraet m iu o s co p ic observation pQ. However, 
even though the technique has been used for UNA analysis 
it bttiO not ready for enost ptetxeemc projects as the number 
of cells that cam be obtained b too small to sxaerate reason* * 
ante ptoteta profiles lb terms of the number of pse4eins that 
can be visualized ft4J). 

As DNA microarraya and proteonshx* ge ne rate more data In 
thsfotumitvrillbcm 

and rapid strategics to vaKdete the. van amount of informa- 
tion, that win be gcoenteo\ particularly in tissue biopsies. Ibis 
wc believe can be cccompHibvd fas past by snaking use of 
specific antibodies m combination wiuVhsmsnnc^trtochcmistry 
(1.121 At present, them is no technology at band that may 
allow us to prepare antibocfim at will, although phage anti- . 
body Kbraria |I4) show much promise, 

Finally, we would Hke to cmeuussise that biology In this 
Muleanium wiO be chaiactericcd by the study of oomptca 
biological phenomena. DNA ntscroartoys and protcomJcs 
are just some of the technologies of functional genomics, 
and only their integration nay a Dow us to tackle the great 
complexity underlying biological processes. 
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MOLECULAR ANALYSIS OP CANCER USING DNA AND PROTEIN 

MICROARRAYS 

Jam Meate-Guipido end Sea M. Himib* 

DfTBODUCfTQN 



>. has beca nude to our un derstandi ng of cancer a • muttitej* 
'disease thai involves progressive cheaps in inn pnoma end rafeeme. Vnrinua 
types cJcanem share rimttentitt^ 

molecular omiia. Mkroarray tenodeses have the potential of preWidhig vetoabte insight 
rapv«qg disease processes. The amy foimat b now en established mtthod for gtotn) 
analysis of nucleic aefda, ond »n the past few years Ada eppraech has bee 
protein snnte (Table IX Mtemunvj allow profiling of toroors* genomes, i 
and praeonjea at e Kate onoRalsobas pjvvieasr/. 
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GANCSft PROS1UKC USING DMA ftUCROAJUUYS 



studies! especially profiling gene expression, using DNA amy have bed e 

OYtnenooBs Impact on Momedtcai research, resulting in well over 1.000 published repona 
In 2002 alone. A substantial number of published ftodies dealt with cancer. Vistas* related 
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eppllcau'ons of DNA mieroarrays include uncovering unraspccted associations between 
genes eod specific dlnicaJ features of disease that are helping devise novel moleculsr based 
disease classifications. Most published tumor studies using DKA microBnoys have either 
examined a pathologically homogeneous set of tumors to Identify clinically relevant 
subt>pes* for example survivors vs non-survivors, or pathologically distinct subtypes 
belonging to the same Imeagc. for example limited stage vs advanced stage tumors to 
identify molecular correlates, or rumors of different lineages to identify rooJcceJer 
signatures for each lineage. 

One of the landmark studies that have attracted touch interest with respect to the 
potential contribution of DMA nucroerrays to uncover novel classes of femora, is en 
analysis of diffuse large tVcell lymphoma, the coast common subtype of fton-Hodgkioi 
lymphoma . Large B-ceJl lymphoma is • cUoicalry rattrogenccsts disease. Only 40ft of 
patients have a good response to current therapy with a proSoogecl survival. A systez&iSc 
characterisation of gene expression in this disease using DNA au q uai i oya uncovered a 
■ diversity in gone expression chat teflected variation in tumor proliferation nil; host 
response and differentiation slate of the rumor. Tw© roufecularly distinct forms of cfiffese 
targe 8-ccil lywuliuiaa were mmwuuJ which had gene nonunion patterns imlicailve of 
cfarTereol atagca of B-cctl dnlerentJatjoew One type ex pressed acnes characteristic of 
germinal center B ceBs and bad a significantly better overall survival than the eecoad type, 
which ca pressed genes oercnally induced during to view activation of peripheral blood B 
cells. The analysis cherefcre identified previously undetected and critically signutam 
subtypes of lympboms, 

Stwlies to classify Ines^cjtroo^ 
tomon could be classified Into a basal epithelial like group, an £RBB2-cmrenre*suia 
group and e nortsal breast-fake gronp* 1 w . A tumina) epiiteual/eatrogan recmptv-poaltrve 
group could be divided into at least two oubgmopa, each with o distinctive expression 
pcofda. Survival analyaes on a euboohort of patients with icjcsty'edVaoced breast cancer 
unJfwotiy treated, in a prospective study, showed tiasJfteamly diffcrcnl outeemes for 
patients belonging to the various groups, including a poor pmanosla for the basal- tike 
subtype and. a significant difference m outcome for the two estrogen receptor-positive 
groups. In on iodepeneeni study of 39 invasive breast canccxa. striking smncctder 
differences between ductal carcinoma specimen* were uncovered that lad to a satiated 
new classification for esoogen-receptor negative breast cancer". Similarly, a stody of 58 
nedfroegative breast caicinosnas discordant for BR atatua also uncovered a list of genes 
which discriminated tumors seending co EJt suma™. Ardnclal neural networks could 
accurately predict Est suuus even after excluding top discriminator genes; Including BR 
itself. Only a email proportion of the 100 most important ER discrfanhtamr fence ens 
regulated by estradiol In MCF-7 cells. 

An Informative approach to analyze DNA micToarray data in clinical absence is to divide 
such data into a matting set to uncover associations between specific genes and certain 
clinical features of the disease, and ■ testing set to vslidate these aasodationa. However 
smee both omitting and testing sets are derived from the same pooJ of 
samples were available to the.ijrveatlg^tom the extent to which such 
apply to other patients not included In the study, who may have different < 
cannot be inferred. Beer et al. have undertaken a study of long cancer in which the 
association they observed between a set of genea and patient survival wis validated with o 
testing set of rumors tftey hod available and further validated with an independent set of 
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tumors for which microarray data was collected by another group no: associated with tftat 
itoi/ 9 . Such extensive validation clearly indicated the robustness of the associadoo 
uncovered betweco o set of genes and survival in lung edcmxerciiioma. 

The Dtrnicms published no dies using DNA microarrays justify the use of Iha 
technology for uncovering patterns of - gene expression chat are clinically informative, 
However fit Is sobstondally rnore difficult to develop an uodentawfcng of disease at o 
rotchanistic revel using DNA uuuoanays. For coast of the published studies it is unclear 
how well UNA levels reported corretae with protein levels. A lech of correlation may 
toply that she predicdve property of the gcne(c) Is tndepestfest of gene function. In stodia 
of lung cancer, Chen et al. coDected both DNA microarray end 2*D PAGE dan, which 
allowed them to compare mftNA and protein levels in the seme tumors 10 . The integrated 
mtrnsities of 1 45 protein spate representing protein produces of 98 genes were analysed la 
76 lung adertocercrnernas and 9 uruuTccted lung tissues osing 3-D sets. Far the same 85 
samples. mRNA levels were determined using oUs^rmdcotido mluuanuyt. Oaly 21 of 98 
genes C21.4%> had a stsristicalry significant correlation betwecv .protein and mRNA levels 
0 >0.2445; F <0J>3). The insuNAr>reteio rarrcUtion eoefrtcierrt* also varied between 
isefosms of the same protein, indicating potentially isoforn>spcciric mfrhimhrna for the 
regulation of protein abvndaitcev 



PROTEIN MICBtOAWUVS IN CAJf CER RESEARCH 

Despite the advances In our ortdsrsuuiding of the molecular basis of t 
gaps retrain bosh Hi our understanding of cancer pathogenesis and In the < 
effective strsargjot for early diagnosis and for u ratine bl a pre 
Mvcstfgatmj diseases such as cancer 1 easy ovemiM aorne of the Uratatfofts of < 
approaches'. DMA m luuaiiaya have limited utility for the analysis of biologleal floidf i 
for oncoming directly, in the fluid, ssssysbte btomarters, Nernrrous sltemunns may < 
m protases that ore not reflected rn chsnges at the RNA level. 

Unlike DNA nucsoarrays that provide one measure of gene aprcaslon, tamely RNA 
levels, there is s need to tmplesnent protein jrncfeairay strategies thai address the many 
difTcreni features of proteins including dcterrmnatien of their levols in rjJoiogJcaJ samples, 
and alternation of their selective interactions with other proteins, antibodies, drags or 
various small ligansts. Arrays that irjcorpcrate antibudkr n ' 1> or rcc omMnam proteins 
obtained osing eDNA expression ltbnvteV 041 or phaje^lsplay libraries 4 ' have been 
utilized for different types of protein based assays. With other types of nriaoarreya, whole 
tissoe»derived samples neve been direcdy arrayed onto slides, to assess the reactivity of 
total protein lysatea with specific Kgand**" 4 ^. two practical applications of protein 
mtcroanays were presented by sTief i rtrh* *, designated protein function array and pfotehv 
detecting array. With protein function arrays, a large amount of protein is spotted on a solid 
support at a defined location and tested to characterise cither a biochemical activity or a 
molecular Interaction. The pretdn-desecting array consists of an arrayed set of protein 
ligands used to profile gene expression end draw signatures mdtcative of the cellular state. 
The whole process of assembling tfie protein array requires considerations related n> the 
nature of the support the type of trnrnobUization, as well as das molecular architecture of 
die particle being a Bached. Four main different supports have been optimized to ucifoiin 
assays of this megnrtude: chemically modified glass riides (poryrC^ysioes, polyaloeaides, 
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these samples. Patterns that distinguish between cancer patients ami normal subjects with 
remarkable attorney, have been reported for several types of eraser 1 . The coupling of 
protein arrays with moss spectrometry technologies is likely to become a powerful analytic 
tool with which to profile protein ejrpraifon. Such en approach, fcaown as AottreQup 
(Gphe/gen Inc. USA), was successfully applied to snujy prostate and ovarfsn cancers* 
sod, more recently, head and neck cancers" 9 . Rcsultt from these studies revealed the 
involvement of proteins in carcinogenesis processes and specific airy identified protein 
fingerprints from which cancer biomsrktn were eatraeittL The major drewbocks of direct 
analysis of tissues or biological fluids by MAU>! is ths prefemnticJ detection of proteins 
whh a lower molecular mass and the difnetriry m Identifying, the proctitis coiiajponding to 
the masses observed. Aether technological improvements could enhance the ntiHiy of 
direct mass spex trams trie analysis of tissues and bieJogpcal fluids* 
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Cancer- tissue profiling studies thai hsve utiGsed protein nrftTOs nays are beginning so 
emerge' As a model to better understand how pasterns of protein expression shape the tissue 
luiuouivtmnmestj Khexevtc ct a), analysed protein expression in tissue derived from 
squamous cell carcinomas of the oral cavity through en antibody luiiro&iiey approach for 
Mg9Mhrooghput preuxunic analysis . Utilising laser capture microdissection to procure 
total . protein from specific microscopic ceOular eae^uiations, they dcmonsMed that 
quantitasivav and potentially qualitative, dUtcrcsscee In e,apir sal o n patterns of omhipto 
proteins within cpithcHs] cells leproducisiy correlated with oral cavity tumor progression. 
DuTereadal expression of multiple proteins was found m stomal cells surrounding end 
adjacent to regions of diseased epiiheHum that otnectly ccmleted with tumor progression of 
the epithelium. Most of the proteins identified m both odl types were Imolwd in signal 
transduction pathways* They hypooSssBed therefore that 
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comirtjnicaiiora involving complex cellular signaling between epithelium find stroma prey 
• key rAajojbvvIng oral cavity cancer progression. 

A clinically relevant application or protein mtcroarreys is the identification or proteins 
(hat induct an antibody response in autoimmune disorders 0 . Mkroarrays were produced 
by attaching several hunctad proteins and peptides to the surface ofderiveurxd glass stales. 
Arrays were incubated with patient serum, and fluorescent labels were used to detect 
autoantibody binding to specific proteins in svtotamtme diseases* including systemic lupus 
erythematosus end rheumatoid arthritis. Such nocramyi represent a powerful tool to 
study immune responses*. io * variety of diseases including cancer. 

A reverse phase protein array approach that Immobilize, the whole repertoire of a 
tissue'! protesiu has been developed*. A high degree ofsenativiry. precision and tinearirr 
was achieved, making It possible to quantify the phojphorytsled status of signal proteins in 
human dssue ecD suhpopolations. Using this approach Paweletret al "have le^tudhmOy 
analyeed the stole of pro*survivaJ checapdm proteins at the tticroseopie Wanuoo Stan 
from patient tnttched histcdogicalty normal prostate eptlhelimn to prostate inowfthzHsJ 
oeophuiaand to invasive prostate cancer. Cancer profession vu associated wim T ariff 
phosphorylation of Ala. e uppr e as ion of apoptosis pathways, as wall as decreased 
phosphorylation of ERJC At the transition from litstolegSeaJly ftormaf epitheiltmi to 
Intraepithelial neoplasia, o statistically significant surge m phospnorytsted Aks and a 
concomitant suppression of downstneam apoptosis pathways preceding the tauition farm 
invasive oociooma wore observed. 

A major challenge In snaking bfocMps for global anarysla of protein expression is the 
current lack of comprehensive sets of genome scale capture agents such aa antibodies, Aa a 
result, biochips that target specific classes of proteins such aa sonatas or cytokines are much 
easier to produce, that would have clinical utility. Another Important consideration to 
protein oiiuuaiiays is diet proteins undergo numerous pcrttrsrudadonel njodUftcadons eg 
phosphorylations, glycosylate*, which are highly ioiportsni to ihdr functions, as they can 
determine activity, stability, lombraTion and turnover. To eddrcsa the need lor 
comprehensive analysis of proteins In their modified forma, several approaches to the liqoid 
based eeparatlon of ceil and dssue fyaaxes were investigated in order to obtain protein 
racoons with reduced coomleaiiy or pure Individual proteW 1 . Tna separation product 
can be arrayed in o manner that allows the probing of protein constituents of celii and 
tissue* u> uncover specific targets. Fc* example, using a combiiiation of anion ojumansc 
and n verse phase tX. Madna-Gurpide et aL have otaamcd soma 2000 rodJvidual prooftn 
fractions that have been utilised to produce mkroarrays that interrogate cancer cell 
prousemes. fractions thai read with speculc probes arc within the reach of • 
chromatographic and gel based separation techniques for resolving their individual protein 
constituents and of mass spectromctrie techniques for ideritification of their constirecnt 
proteins. The IC procedures allow surTtcierit protein amounts to be resolved far the 
construction of large numbers of micmarnyt from a given cell or tissue source. 

Protein microarraya of afferent types are likely to become canunercfally ovaJIsbla for 
assays of broad sets of proteins and may well rival or at least complement DWa 
micro troys ts tools far global expression analysis. 
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SUMMARY 

In conclusion, array-based technologies Dove emerged mat caaoiboie to profiling 
tissues at die genomic enoscripiomic and protcomic levels. Analytical cools an* needed to 
mine the vast amount of data generate* Ultimately the molecular analysis of cancer at a 
genome aid preteome stole will allow better classification of disease and tailored 
indiviAuditecltben^ 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified Tour putative onco- 
genes thai could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell lines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in tumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify 1 83 genes with increases in genomic copy 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
cell lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S). 
Using random permutation tests across all cancer cell 
lines, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over -r eprese n ted i n th e genome (Table 1 ), suggesting 



that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
linking genomic changes with altered transcript patterns 
in breast cancer fHyman el al., 2002; Pollack et a/.. 
2002). However, our finding that only 31 % of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et al., 2002; Pollack 
et al., 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 

Proteomic analyses identify four genes for which protein 
ahun<kmce was associated with increases in the cognate 
gene and transcript levels 

' Analysis of transcript patterns is insufficient for under- 
standing the expression of protein products and the 

eifcet of genomic O'.cr-rcprcseiuation on the expression 



of their cognate proteins. To extend these findings 
beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell lines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9.5%) of those 
42 genes - PRDXl, EEFIA2, CALR, and KClP-l - 
showed statistically significant correlations between 
elevated protein expression and increases in both , copy 
number and mRNA expression (all r>0.84; /><0.05) 
(Table 2) in the cancer cell lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression ofPRDXI, EEFIA2, CALR, and KClP-l 
in lung cancer cell lines 

To confirm our findings from the high-throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell lines. For compar- 
ison, we arbitrarily chose one gene, NFKBI, in which an 
increase in protein level did not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure I). For example, KCIP-1 showed 
fivefold amplification in five of the six cancer cell lines. 
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whereas NFKBI showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKBI transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDXI as 
compared with normal cells. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer cell lines that 
showed at least fourfold increases in EEFIA2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as well (relative to 
their expression in normal cells) (Supplementary Figure 
3S). The protein abundance of the four genes showing 
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Table I Lisi of 183 genes' with statistically significant correlation 
(0.05) between genomic copy number and transcript level 



Table I {continued) 
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AKRIAI 
PRDXI 
UQCRH 
RPL7 
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PSMB4 
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RPS27 
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CRABP2 
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CMITI 
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RPL3I 
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RPL37A 
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SI OOP 
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SPPI 
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FO G 



SDHA 

PDCD6 
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RPL37 
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SOX4 

HDGF 

RPSIO 
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IG.FBP3 
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) COLIA2 
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2 
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2 
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3 
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5 
5 
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5 

6 

6 

6 

6 

6 

6 

7 

7 

7 

7 

7 

7 

7 

7 
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8.5 
20.1 
26.4 
32.2 
45.4 
45.4 
46.2 
96.4 
102.6 
147.3 
148.1 
150.7 
150.7 
151 
151.9 
153.1 
153.4 
159.3 
165.8 
199.7 
200.2 
9.6 
10.60 
101.20 
1 10.40 
191.80 
198.30 
198.30 
217.30 
217.50 
3.30 
65.30 
89.00 
46.3 
151 
161.5 
6.7 
39.3 
41.1 
89.3 
154.7 
156 
is* 



0.251 
0.305 

10.3 

14.2 

40.8 

74 

132.2 

147.2 

179.2 

21.7 
. 22.6 

34.6 

35.4 

43.7 • 

45.4 
5.3 

24.9 

25.9 

45.7 

72.7 

75.5 

92.7 

93.6 

98.7 
133.6 



0,0085 
0.01 1 1 
0.0113 
0.0114 
0.0128 
0.0122 
0.0125 
0.0127 
0.0129 
0.0222 
0.0131 
0.0134 
0.0135 
0.0266 
0.0143 
0.0167 
0.0148 
0.0152 
0.0234 
0.0154 
0.0165 
0.0159 
0.0119 
0.0161 
0.0169 
0.0175 
0.0277 
0.0185 
0.0388 
0.0189 
0.0193 
0.0204 
0.0285 
0.0455 
0.0207 
0.0211 
0.1122 
0.0215 
0.0222 
0.0227 
0.0231 
0.0235 
-0-0441 



0.0243 

0.0245 

0.0446 

0.0248 

0.0251 

0.0336 

0.0466 

0.0256 

0.0263 

0.0321 

0.0362 

O.0I77 

0.0369 

0.0372 

0.0173 

0.0378 

0.0381 

0.0289 

0.0389 

0.0403 

0.0433 

0.0425 

0.0457 

0.0475 
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Gene symbol Chro. Distance from p arm of each 
chromosome (Mb) 
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THBS2 
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SPINTI 

RPLI7 

PKM2 

IDH2 

RPL23A 
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UBE2I 

RPS2 

CLDN9 

ARL6IP 

EIF3S8 

TUFM 
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GPR56 
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APRT 

GALNS 

RPLI3 
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0.01 19 


8 


144.4 


0.0121 


8 


144.5 


0.0122 


8 


145.6 


0.0128 


9 


II7.I 


0.0136 


9 


121.1 


0.0133 ' 


9 


121.5 


0.0135 


9 


124.4 


0 0137 


9 


133.4 


0.0139 


10 


3.2 


0.0223 


10 


5.1 


0.0146 


10 


75.6 
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0.0215 
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0.0223 
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0.0435 
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0.0235 
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Table I {continued) 



(J cur symhot 


Chro. 


Distance from p arm of each 
chromosome (Mb) 


a 


F.RBB2 


17 


35.11 


0.0483 


JUP 


17 


39.8 


0.0495 


c:rf ; 


17 


40.39 


0.0505 


RPL27 


17 


41 1 


0.0046 


NMEI 


17 


46.59 


0.0082 


COMA'I 


17 


48.6 


0.0108 


ABCO 


17 


49.1 


0.0326 


NMK2 


17 


49.6 


0.01 II 


RPI..18 


17 


72.7 


0.0117 


SMT-JK* 


17 


73.6 


0.01 19 


SYNGR2 


17 


76.6 


0.0122 


LGALS3HP- 


17 


77.4 


0.0127 


P4HB 


17 


80.3 


0.0126 


PPAP2C 


19 


0.221 


0.0228 


OPI 


19 


39.55 


0.0145 


HPN 


19 


40.2 


0.0129 


2NF146 


19 


41.4 


0.0131 


SPINT2 


19 


43.4 


0.0238 


PSMOX 


19 


43.5 


0.0132 


YIFIP 


19 


43.5 


0.0135 


R PS 1 6 


19 


44.6 


0.0144 


CKACAM5 


19 


46.9 


0:0145 


CEACAM6 


19 


46.9 


0.0143 


GIPR 


19 


50.8 


0.0259 


SNRPD2 


19 


50.9 


0.0413 


KDELRI 


19 


53.6 


0.0152 


RPL28 


19 


60.6 


0.0156 


RPS5 


19 


63.6 


0.0267 


TRIM2X 


19 


63.7 


0.0158 


DAP 


20 


35.6 


0.0166 


TOPI 


20 


40.3 


0.0172 


UBC2C 


20 


45.1 


A Al HA 

U.UI /** 


RPS2I 


20 


61.6 


0.0268 


EF.FI.A2 


20 


62.8 


0 Ot85 


Tf-F.l 


21 


42.6 


0.0186 


TI-TI 


21 


42.7 


0.0192 


CSTB 


21 


44.1 


0.0201 


MIF 


22 


22.6 


0.0202 


XI1PI 


22 


27.5 


0.0204 


PRDX4 


X 


22.9 


0.0198 


SYNI 


X 


46.3 


0.0204 


TIMPI 


X 


46.3 


0.0209 


PLP2 


X 


47.8 


0.0212 


MAfiFDI 


X 


sn \ 


0.0331 


RPS4X 


X 


71 


0.0124 


SSR4 


X 


152.6 


0.0232 



(Supplementary Figure 4S). In contrast, EEF1A2 and 
KCIP-I protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 15-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for the disease. 



Amplification and protein expression of KCIP-I and 
EEFIA2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemicai analysis, in parallel, to commercially 
available human lung tissue microarrays (Ambion, 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDXI or CALR were 
not suitable for use in immunohistochemicai analysis 
when this report was written.) Overexpression of KCIP- 
1 and EEFIA2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(/> = 0.0003 for KCIP-1 and P = 0.0011 for EEFIA2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCIP-I, and 
strong cytoplasmic staining for KCIP-I protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by analysing Ki-67 
expre ss ion in contiguous flections of the tissue micro - 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibition of EEFIA2 and KCIP-I expression by 
siRNAs led to decreased cell proliferation and induction of 
apoptosis 

To further prove the oncogenic function of the identified 
genes in lung tumorigenesis, we used siRNAs to inhibit 
the endogenous expression of EEF1A2 and KCJP-1 
^protein in Tour lung cancer cell lines (HI 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEFIA2 and 
KCIP-I protein by 70-90% 48 h after transfection 



arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-I (P = 0.02) and 
EEFIA2 (/> = 0.0I). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage I rion-small-cell lung cancer who had 
undergone curative surgery (Wang et at., 2005). 
Immunohistochemicai analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-I in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEFIA2 and KCIP-I expression and clinico- 
pathologic variables. Expression of EEFI A2 or KCIP-I 
was associated with short overall survival time 
(/> = 0.OOI2 for EEF1A2 and P = 0.0026 for KCIP-I) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Table 2 



Arc. no. 



Proiems showing significant ovcrcxpression in cancer cell lines relative to those in normal bronchial epithelial cell lines and their 
correlation coefficients with increased DNA copy number or mRNA values" 
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4790 
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KRTI7 
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IGFBP-3 
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HP 
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Eukaryotic translation elongation factor I gamma 
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Heat shock 27 k Da protein I 
Phosphoglyccrate kinase I 
Transgclin 

Hemopoietic cell kinase 
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Huntington-associatcd protein I 

Keratin 18 

Keratin 8 

Haptoglobin 
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S 100 calcium-binding protein A2 

Zinc-finger and BTB domain containing 17 

Transformed 3T3 cell double minute 2 

Microti! bulc-associated protein 4 
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Hemopoietic cell kinase 

Matrix metal lo proteinase 7 

Cell division cycle 25 B 

Cell division cycle 42 (GTP-binding protein, 25 k Da) 
Ras homolog gene family, member A 
Ras-related C3 botulinum toxin substrate I 
Tubulin, beta polypeptide 
Cyclin HI 

Colony stimulating factor 2 (granulocyte-macrophage) 
Proteasome (prosome, macropain) subunit, beta type, 6 
Aldehyde dehydrogenase I family, member A I 

Glutathione 5-transCcrase M4 

Eukiiryotic translation initiation factnr 5A 
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0.92364 
0.90218 
0.84128 
0.84467 
0.91884 
0.00236 
0.84218 
0.06412 
0.00446 
-0.04884 
0.00364 
0.50402 
-0.34128 
0.01446 
0.026623 
0.094884 
0.12364 
0.010218 
0.041280 
0.044679 
0.031264 
0.04467 
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-0.17636 
-0.19782 
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-0.25778 
0.25644 
0.65533 
0.234987 
0.045116 
-0.47636 
-0.49782 
-0.05583 
0.255533 
-0.65116 
-0.64636 
-0.69782 
-0.75872 

-0.78533 
-0 978<n 
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0.91892 
0.89456 
0.86434 
0.85499 

0.76786 
0.86892 
0.69456 
0.16434 
0.85549 
0.86786 
0.31892 
0.79456 
-0.26434 
0.02549 
0.01 1 23 
-0.03214 
-0.08108 
0.60544 
0.84566 
-0.14501 
-0.13789 
-0.14501 
0.243214 
0.048108 
-0.50544 
-0.05356 
-0.53444 
0.053666 
0.054501 
0.876820 
0.283214 
0.088108 
-0.00544 
-0.03566 
0.145010 
0.232149 
-1.28108 
-1.30544 
0.03356 



0.134501 
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s£?uL^ s.gnificant .ncreased protein expression with increases in both genomic copy number and transcript 

r ^ ^ °f POtenl . 1 ^ t 'V nCOgCnC in lung ^^nonm «ILs. V. Spearman correlation coefficients between 

hl^jfcj t Ure bascd on al1 s,x ™ ncer «" linc * Md indicalcs '<°-°5, if r>0.84000. Mw, molecular weight; pA 



immunohislochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again that genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance or proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67. a known indicator of poor 
prognosis in lung cancer (Martin et al., 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
^expression of EEFI A2 and KC1P-1 proteins in stage I 
non-smaH-celJ lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
from those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes (PRDXL CALR^ and KCIP-J) have been im- 
plicated in lung tumorigenesis. PRDXI is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kim et al. (2003) found 
PRDXI expression to be elevated in both lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDXI may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2+ binding and the 
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Figure I Confirmation by Southern, Northern, and Western blot 
analyses of increased DNA copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, we arbitrarily chose one gene, NFKBI, in which an 
increased protein level did not correlate with genetic changes. The 
blotting results ;irc consistent with the results from the CGH array, 
trunscripi array, and protcomic analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated. at 
least three limes with each cell line. Means of normalized to 0-actin 
signal intensities on Southern, Northern, and Western blots, along 
with 95% confidence intervals, were calculated (/J-actin signals are 
not shown in the figure; two different normal bronchial epithelial 
cell lines were used in the confirmation and onlv one normal cell 
line is shown in the figure). 
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transcriptional regulation of other genes and was 
.recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCfP-J 
belongs to the 1 4-3-3 family, which participates via the 
MAPK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et a/., 
2005). KCIP-i was recently found to be expressed in all 
1 2 lung tumors tested in a single-institution study (Qi 
et oL 2005). Interestingly, EEFIA2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et a/., 2002); functional analyses have estab- 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEFIA2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the ■ 
>ower of our functional genomic strategy for rapidly 
Identifying potential oncogenes. 

Although the main focus of this study was to 
specifically identify putative oncogenes, it should be 



noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
translation*!, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Chen et at. y 2002). For example, 
NFKBI is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et al„ 2002). De- 
regulation of NFKBI is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et aL y 2002). Tn our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript levels, implying 
that at least some of these genes are 'passenger' genes 
that are concurrently amplified because of their 
location with respect to amplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo microarray we used consists of 
22000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence tags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
using microarrays with information on more genes 
and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
consequences of gene amplifications. Such expanded 
analyses will very likely lead to the identification of 
additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For example , genomic c o py number a n d protein l e v els 
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of KCTP-I were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et at., 2004) and 
proteomic analysis (Chen et al y 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, traTnscript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression of the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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Materials and methods 

Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, HI792, 
SK-UM, H522. and HI 563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 
Two norma l bronchinl cp i thc li u l cel l Iinc3 were obta i ned from 

i* I n I aaU /* Dii J ~ All.. /~** A T f f> A i v. . _ - . 



Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > I). 



Clonlcch (Palo Alto, CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells/ 

DNA and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization were performed as 
described previously (Barrett et aL 2004) with Agilent's 
Human I A Oligo Microarray (V2) (Agilent Technologies, 
Palo Alto. CA, USA), which, contains 22000 unique 60-mer 
ohgos. Details of the protocol for analysing transcripts are 
available at lutp://www.chem.agilenl.com. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene cluster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilcntCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from normal control cell hybridization, as described elsewhere; 
Barrett et aL 2004) were interpreted as evidence of true 
differences in DNA copy number. The criteria for defining 
genomic ovcr-rcprcscntation and amplicons are described 
elsewhe re (Hvnum *■/ aL 2002): details are given in the. 

Oncogene 



Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two-dimensional PAGE and their 
idenuncation oy mass spectrometry were performed as 
previously described (Shen et aL 2004). Briefly, protein pellets 
were solubihzed in rehydration buffer, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad's PDQuest 2D gel image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRQ matrix-assisted 
laser desorptipn ionization/time-of-fiight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense monoisotopic peaks for each sample was 
entered in the MS-Fit search program (v3.2.l) (http:// 
prospector.ucsf.edu/ucsfhtml4.0/msfit.htm) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern, Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA. USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDXI, CALR, 
NFKB I, KCIP-I. and /f-actin from Santa Cruz Biotechnology 
(Santa Cruz. CA. USA); and EEFIA2 from Upstate Biotech- 
nology (Wa I thanv MA. USA). 

Fluorescence in situ hybridization and immunohistochemical 
analyses of lung tissue microarrays 

Fluorescence in situ hybridizations and immunohistochemical 
analyses of KCIP-I and EEFI A2 were carried out as described 
elsewhere (Jiang et aL 2002; Wang et aL 2005) with Lung 
Tissue Microarrays (Amnion, Austin, *TX, USA) and II 
homemade microarray blocks containing tissue samples from 
1 1 3 patients with pathologic stage I non-small-ccll lung cancer 
(Wang et aL 2005). DNA probes specific for KC1P- 1 and 
EEFIA2 were obtained by screening a Human BAC Clone 
library (Invitrogcn) by polymerase chain reaction as described 
previously (Jiang et al.. 2002). The antibodies used for the 
immunohistochemical analyses were the same as those used 
for the Western blotting. Cell proliferation of the lung tissues 
was assessed with a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transfection, cellular proliferation assay, and apoptosis 
analysis 

Transfcclions were carried out by using siPORT Lipid 
Transfcclion Agent (Ambion) with siRNAs targeting KCIP-I 
or EEFIA2 or with a scrambled siRNA duplex (siControl) 
(Dharmacon Inc., Lafayette, CO, USA), with PBS used as a 
negative control (Jiang et aL 2002). Cells were fixed 24, 48, or 
96 h later and subjected to further tests. All siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of cell growth by the 
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siRNAs was determined by MTT staining, and cell growth 
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Amplification and overexpression of putative oncogenes 
confer growth advantages for tumor development. We 
used a functional genomic approach that integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines. Next, we used two-dimensional polyacrylamide gel 
electrophoresis and mass spectrometry to identify 42 
proteins that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDX1, 
EEF/A2 9 CALR, and KCIP-I - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r>0.84, two- 
sided r><0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EEFIA2 and KCIP-1 expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunohistochemical analyses of EEFIA2 and KCIP-! in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
protein expression for both ^n** *nA tu<^ protein 
overexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEFIA2 and KCIP-J and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma. 
Oncogene (2006) 25, 2628-2635. doi: 1 0. 1 038/sj.onc. 1 209289; 
published online 12 December 2005 
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Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage, the ability to identify 
putative oncogenes that are activated during tumorigen- 
esis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER-2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, HER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et ai, 2001; Jiang 
et aL, 2004; Tonon el aL, 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
a n d mRN A leve l s in cDNA microarrays perm i ts 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et aL t 2002; Pollack et a/., 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of proto- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate proteins and post- 
transcriptional mechanisms (Chen et at., 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over-represented but are not biologically 
relevant in tumor development. 



